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© A yeast cell of the genus schwanniomyces. 



© The present invention relates to a yeast celt of the genus Schwanniomyces wherein said yeast cell contains 
at least one expression cassette comprising: 

a) a first DNA sequence serving as a regulon, 

b) optionally a second DNA sequence coding for a signal peptide, 

c) a third DNA sequence coding for a foreign protein, and 

d) optionally a fourth DNA sequence serving as a terminator. 

in a preferred embodiment of the invention, the first, second and/or fourth DNA ^^"^ 

genes coding for amyloiytic enzymes of yeast ceils of the genus Schwanniomyces. The ^^J^^ 
sequences in an expression cassette provides efficient expression of foreign genes in yeast celfc ^accora^ng to 
the present invention and, furthermore, provides a system for efficient secretion of the express.on product .mo 
the medium. 
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The present Invention relates to yeast cells of the genus Schwanniomyces and a process for the 

production of polypeptides. 

For expression of eukaryotic proteins of biotechnogical relevance there exists a variety of host systems, 
which are selected in compliance with the specific requirements. The most frequently used host «ganwnw 
Escherichia coli, being the best known representative of prokaryotic host orgamsms; Saccharomyces 
ISvTsSlnTanimai tissue cultures are used as eukaryotic host celis. For certain purposes e.g. where 

s iSeS of r glycoprotein is desired, expression in E^coli or other prokaryotic orgamsms is no 
Xp atl since'Le lack any glycosylation capability. On the other hand, the mass ^onoi 
protein in issue cultures still is cumbersome and expensive; therefore the organ-sm of cho.ce ,r many 
Zees is yeast of the well characterized genus Saccharomyces . S. cerevjsjae ,s able to secrete 
efSnS small proteins, e.g. polypeptides like MG1 (Kurjan and Herskowitz. 1982). 0-endorph.n or «- 

„ I! Sr e P . al.. 1984? however proteins of higher molecular weight are <*™*£*£r» 
srSce (Emr et al. 1981) or the cytoplasma. thus causing decreased growth rates or even death of the ce I. 

spacevcmroiai, ioo ; j w e «„ imomuari wikaryotir; exoression svstem. 

Therefore it is an object ot ine preseiu hivwiuum iu H 'w«~ s - ■ - • • 

which is optionally capable of secreting proteins encoded by a foreign DNA into the culture ™*™ 

This object has been solved by a yeast cell of the genus Schwann .omyces, where.n said yeast cell 

75 contains at least one expression cassette comprising : 

a) a first DNA sequence serving as a regulon. 

b) optionally a second DNA sequence coding for a signal peptide 

c) a third DNA sequence coding for a foreign protein, and 

d) optionally a fourth DNA sequence serving as a terminator. ^ . ^ annar hu rhP 
The invention, which comprises further subjects, is now described in a more detailed manner by the 

following description, examples and figures. 
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BRIEF DESCRIPTION OF THE FIGURES 

R A 1 Restriction map of the AMY1 gene encoding ^-amylase and its 5 and a' # noncoding region 
B- Nucleotide sequence of the AMY1 gene and its flanking region. In the 5 region the transenpbon 
initation site istSeo oy a dashed line above the sequence. Within the structural gene the presumptive 
^pJlJJoSw are indicated by star-symbols. The two potential glycosylates, the 
^1SZ »JL» forminp four disulfide bonds (concluded by ^^J%^ ^ 
enzyme) are underlined. Within the 3' end region the transcription terminate s.gnal TAQ...TATGT...TTT » 
indicated by a solid line. 

R A 2 Restriction map of the GAM1 gene encoding glucoamylase and its 5 and 3' noncoding region 
B- NuSde sequence of the GAM1 gene and its flanking region. In the 5 region the Mnecnptar. 
initiation site is indicated by a dashed line. Within the structural gene the presumptve s,gna. peptides 
cleavage sites are indicated by star-symbols. 

M13 EcoR.-Scal fragment, containing a 1.8 kb promoter *!^^ t ^^*j2j 

gene fragment of a-amylase gene, was subcloned into M13mp8 prev.ously d.gested w.th EcoRI and Smal. 
By site directed mutagenesis a Xhoi-site was inserted just in front of the ATG codon: 



** 



45 51. .TAAAATAAAAGCTCGAG A JG_A_G_A TJ_T ._ . ^JJ. . 



Positions where bases have been exchanged by site directed mutagenesis, are indicated by asterisks. 

50 oMaoSis The Bglll fragment containing the glucoamylase gene and its 5 and 3 flanking region was 
2SS the Ba^nH. sfte of vector P MAC5-8 (Kramer et al.. 1984). By site directed mutagenesis us ng 
duplex r^A methods (Kramer et al.. 1984) a BamH. site and a Sal. site were mserted ,ust m 
front of the translation initiation codon ATG: 
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* ****** ** 

5« .. CTCAT6ACT6TGTCGACG6ATCCAAGAT_G.AIU_IU«---i:- 

Positions affected by the site specific mutagenesis are indicated by asterisks. 

YRp7aQAM. The GAM gene was isoiated T P™* — S'SX^ ~ 
SS il^n^B^^ — ' YRP706AM - 

Construct of * occidentaiis vectors for expression of foreign genes using the promoter and the signa. 

^aKftT^ fragment fro. ^SS^SSS. 
GAM promoter and the first 208 bp of the GAM encod.ng sequence mto P RR322. cleaved w.th 

msultina in vector pBRGAM. , ^^ xeno ^ntaininn the structural gene 

The resulting plasmid, pMPGCVl contains o*i uy> 

to QAM coding njgM ^?<»^£Z££Z£S^ pBBQCI Wo BamHI/Pstl cot pCJDS- 
, ™ ,^rP*OC.™ «jS- » ce* 9 „0 * « » * 

of the GAM coding region fused to position + 112 of the celD. 

Promotes using the o.ua,ita«ve endog.ucanase 0 activity assay (congo red string) with different 

NG^MPAGC. NGA58: pMPPGC. NGA58: pMPGC1-1, NGA58: pMPGC1-2 
a: Growth on 2% maltose 
b: Growth on 4% glucose 

^SSSXSZZS » to Sfl, » 0- PMPADH, — o ,n 

^c: B.tano. 0. «» OA» P^T * ^ P»^<? ^ " " ^ 
pkffM pMPAG By in «t»o recombinaion. ,.so»in 9 in pl»m,d pMPAOC. 

ConarSo'n o. . lc.de*:!* »e«o, « to - «0" — — — » PDC 

plasmid pMPPG by in vivo recombination, resulting in plasmid pMPPGC. 

F, 9- 10 ,■ „ ,«-tnr (mntainina SwARS2) for transformation of S. occidentaiis. 

Construction of an autonomous repl.catong vector ^^^J^ the 5 . 5 kb BamHhBglll fragment 
The 3.2 kb TRP5 BamHI fragment from plasm.d J35pT8*B. 
of plasmid pBRSwARSGAM (Rg 6a), resulting in pMPTS2-A and pMPi u 

ABBREVIATIONS USED THROUGHOUT THE APPLICATION 
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Restriction endonucleases : 

Aa-Aatil 
A - Asp718 
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B - BamHI 

Bg - Bglll 

C - Clal 

E * EcoRI 
5 EV - EcoRV 

H - Hindlll 

P - Pvull 

Ps - Pstl 

S-Sall 
10 Sc - Seal 

Sm - Smal 

Sp - Sphl 

X - Xhoi . . # . 

Throughout this application various publications are referenced by mention of the first author and the 
year of publication within parentheses. Full citation of the references may be found at the end of the 
SnArificatton as an annex, listed according to their alphabetical order immediately following the specifica- 
tion The disclosures of these publications in their entireties are hereby incorporated by reference into this 
application in order to more fully describe the state of the art as known to those skilled therein as of the 
date of the invention described and claimed herein. 

The yeast molecule according to the present invention is characterized by containing an expression 
cassette which in turn is composed of three to four different components, being arranged in operable 
linkage in order to ensure efficient expression of a desired polypeptide, optionally followed by secretion of 
same. These components are defined to be the following : 

a) A first DNA sequence serving as a regulon. The term tt a regulon" as used throughout the 
application comprises any cis-acting DNA sequence involved in the regulation of expression of a given 
structural gene. The term thus embraces sequences preceeding a given coding sequence, for example, a 
promoter and sequences which are recognised by transcription factors or other proteins involved in 
transcription regulation. The "first DNA sequence" is thus not limited to sequences corresponding to known 
promoters. Also included are DNA sequences corresponding to natural occuring regulons. which have 
undergone insertion, deletion or substitution events, but still retain their activity as regulon. Further the term 
"regulon" includes DNA sequences involved in regulation of transcription, which have been synthesised 
chemically or obtained by methods of gene technology. 

b) An optional second DNA sequence coding for a signal peptide. The meaning of this second DNA 
sequence includes various DNA sequences, directing the secretion of any polypeptide containing the thus 

35 encoded peptide at its N-terminal end. 

c) A third DNA sequence coding for a foreign protein. This coding sequence coding for the signal 
peptide in order to retain a proper reading frame for the third DNA sequence. The fusion thus requires 
application of methods or recombination DNA technology known to every person skilled in the art. For 
methods enabling to design proper fusions it is in general referred to Maniatis et al. (1982), wherein 
appropriate methods, e.g. various ways to treat restriction fragment ends, cloning of linker molecules and 
similar procedures are described in great detail. 

The foreign protein coded for by the third DNA sequences may be any protein which to express there 
exists some need. Since the secretion system is capable of secretion not only of small molecules, but a so 
of large proteins, expression of any desired protein should be possible. For examples it is referred to the 

45 discussion below. t 

d) Optionally a fourth DNA sequence serving as a terminator. As the terminator any sequence may 
be used, which efficiently terminates transcription in the respective host organism. Terminators may be for 
example sequences prone to formation of hairpin structures and/or polyadenylation sites. In a preferred 
embodiment the terminator is derived from the same gene as the second DNA sequence, i.e. the regulon. is 

e The four components are combined with each other using known techniques. Optionally the proper 
fusion will be verified for example by sequencing the boundaries. 

The yeast cell according to the present invention contains at least one expression cassette comprising 
some or all of the above mentioned DNA sequences. It provides a convenient tool for the industrial 

55 expression of any desired protein. 

In a preferred embodiment said first and/or second and/or fourth DNA sequence are derived from a 
gene coding for an anxiolytic enzyme, for example c-amylase or glucoamylase. which enzymes in most 
organisms are induced upon contact with starch. The DNA sequences may be derived from a yeast of a 
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of on. of lha oooor. Debanomyces . Saccf.arc.nw Upomycaa. Ptchlj. or Sacclwctriycas , but 

onJCm^rgl Z „a,a»loca,bons. urca. Star* fa hydrofyaad to gbew by 
SyS^ateo Ho tha SW .rn.«nt. nan«l, «-a™»l«s. ta- l+#*m 

35 ^^ T Sa' 9 aS' S2 S5 — ^ bay. baan 

^as^^ it is a.so possibie to use probers derived from yeast 

^Scl^ 

— ent seque " CeS ° f Ph39e 77 - USed ' Wh ' Ch 
regulated by expression of T7 RNA polymerase. fllrthermore in order to facilitate purification of 

mmmmm 

Ar9LeU c ?Mett^ ValSerAla; 

2 mS ^ ValSerAlalie6lnA1aA.aProA.a; 
d) Metner Metl , e p ne L e uLysL e ulleLysSerlleVallLeGly^uGlyLeu ValSerAla 

,ie T™^ « — - * ™ -** 

^T^GA^ 

QGTA b^^QAQATTTTCMCTQMGGATTTACAAGTAMSTTQTTGCAQCMTTT TAGCATTCTCAAGATT- 

CAG Ta?gaWctgaagctgattaamgtatagtaa^ gttagtgctatccaag- 

CAGCCCCTGCCTCTTCGATTGGATCTAGTGCTTCAGCA. 
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It will be understood, that the replacement of amino acids by residues having comparable properties 
and use of alternative codons in compliance with the possibilities provided by the genetic code will not have 
any impact on the yeast cell according to the present invention. 

The polypeptide to be expressed in a yeast ceil according to the present invention is coded for by a 
third DNA sequence. Said third DNA sequence may comprise natural or synthetic DNA, even DNA 
containing intervening sequences. Examples for proteins being expressed by the yeast cell according to the 
present invention comprise proteins of enormous importance for scientific or medical purposes, for example 
cellulase, interleukine, interferon, insulin like growth factor, lymphokine, human growth factor, nerve growth 
factor, aprotinin. insulin, hirudin, hormones, blood clotting factors, hepatitis B surface or core antigens, viral 
or bacterial vaccines or human granulocyte/macrophage colony stimulating factor. 

The yeast cell according to the present invention may optionally comprise a terminator as component of 
said expression cassette. Preferred terminators are derived from genes of one of the following genera : 
S accharomyces , Pichia . Hansenula , but preferably Schwanniomyces . Examples of preferred terminators are 
those derived from Schwanniomyces a-amylase gene or Schwanniomyces glucoamylase gene, part or all of 
which my be used. Such terminators are comprised by nucleotides 1537 to 1740 of the Schwanniomyces a- 
ar7Vy ,| a3S g8ne or nucleotides 2S75 to 3320 of the Schwanniomyces glucoamylase gene. 

It is obvious, that in case of any of the DNA sequences mentioned modifications, which do not impair 
the biological function of the respective DNA fragment still are covered by the present application. There 
are numerous possibilities to change the DNA sequences mentioned above slightly without impairing the 
biological function of said fragment gene. 

The yeast cell according to the present invention, carrying an expression cassette composed of the 
above discussed components, may be carried by a circular or linear vector, which optionally comprises one 
or more selective marker genes. The present inventors demonstrated, that a lot of marker genes so far used 
for transformation of Saccharomyces , for example TRP5, LEU2. ADE1, ADE2. HIS3. HIS4, URA3. LYS2, 
which may be obtained from Saccharomyces , Hansenula , Pichia or Schwanniomyces , are functional tn 
Schwann iomyces occidentalis . Furthermore it is also possible to use the amylolytic properties ofo-amylase 
or glucoamylase in order to select transformed organisms. The corresponding genes have been isolated 
and characterized. AMY1 and GAM1 are preferably obtained from Schwanniomyces . 

The yeast cell according to the present invention may contain the expression cassette as an insert into 
one of the endogenous yeast chromosomes. Furthermore, there is however the possibility, that a DNA 
sequence capable of controlling autonomous replication and even also stable maintenance of the vector in 
the host organism, is introduced into the yeast cell. Said DNA sequences are the so called ARS sequences, 
capable of controlling autonomous replication and stable maintenance in the respective host organism. Said 
DNA sequences are preferably derived from Saccharomyces , Pichia Hansenula or Kluyveromyces . most 
preferred from Schwanniomyces . 

In order to ensure effective replication in the prospective host organism, it is preferred to use an ARS, 
derived from the respective host organism. 

The present invention provides one new SwARS sequence. The DNA sequence SwARS2 is capable of 
controlling autonomous replication and stable maintenance of the vector in a Schwanniomyces yeast cell. 
The detailed sequence of SwARS has not yet been determined. SwARS2 is localised in the Schwan- 
niomyces genome in the 3' region of the glucoamylase gene, as shown for example in Fig. 10. SwARS2 is 
easy to id entify and efficiently allows autonomous replication of any plasmid. exhibiting this sequence. The 
so far known Schwanniomyces autonomously replicating sequences, SwARSI and SwARS2 are composed 
on a Clal-BamHI fragment (SwARSI) adjacent to the EcoRI-fragment shown in Fig 2a (see Fig. 10) and/or 
an EcoRI-Bglll fragment contained in the EcoRI fragment shown in Fig 2a (SwARS2). 

For the SwARS sequences it is also not necessary, in any case to use the sequence as it occurs 
naturally DNA sequences, that have been obtained by any kind of modifying treatment, for example by 
deletion, insertion or substitution of one or more nucleotides or base pairs, are still within the scope of the 
present invention, provided the modifications retain or Improve the biological function of the yeast cell 
according to the present invention. 

In a preferred embodiment the yeast cell according to the present invention comprises a DNA 
sequence, which is homologous to genomic Schwanniomyces DNA. Provision of structures of homologous 
DNA enable the DNA fragment flanked by such homologous sequences to be inserted by homologous 
recombination. It is thus preferred to have homologous sequences, which do not represent any essential 
gene of the Schwann iomyces genomic DNA, but rather any gene, whose expression product may be 
supplemented by additives to the respective medium or were lack of the respective expression product 
does not have any serious impact on the transformed yeast cell. 

If no SwARS sequence or any equivalent sequence, for example originating from any other yeast 
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soecies is available, the expression cassette can be used without an autonomously replicating agent. In this 
case the expression cassette tends to integrate into the genome, for example via homologous recom- 
bination Preferably there are several copies inserted into one or more chromosomes. 

In this case it is preferred, to have multiple copies of the expression cassette inserted, in order to. 
s ensure simultaneous transcription and translation of several copies. ,«„„«„,„ 

When the yeast cell according to the present invention is derived from a yeast of the genus 
Schwanniomyces the polypeptide products observed following expression of the respective foreign gene m 
Sd v^TeTe xhibit glycosylate patterns, which closely resemble or even match those of the desired 
eucaioTc pTote^ Yeast ceJs of the genus Schwanniomyces are thus preferred for ex P ress.on of 

10 "^rabora-iscussed yeast cell according to the present invention can be used in a process for the 
□reduction of a polypeptide, wherein a yeast host organism is cultivated under suitable conditions and the 
it^ ^Zl^ in a manner known per se. The conditions of cultivation depend on the respective 
So " onanism used. Examples for cultivatiolTcirnditions for yeast of the genus Schwanmomyces are 
provided in the examples below. The polypeptide, produced by the yeast according to the present 
T 1° „ , h >. , nnue n.inn a i ratification methods, which depend on the properties of the 

IMVeilUUM, I© iwwv^tSv. ~j — .. * 

expressed protein, the availability of specific absorbing agents and so on. 

In a preferred embodiment yeast cells of the species Schwanniomyces occdentahs are used as a host 
organism and are cultivated on starch.dextrin. maltose and/or plant biomass as the respect,ve carbon 
source. As commonly known, these carbon sources are available at low costs. nr „ Hl , rtin „ r, f 

Furthermore, there is a possibility to use cells of the genus Schwanniomyces for the production of 
sinole cell protein. The single cell protein may then be subjected to any further use as rt is requ.red. 

t a prKd embodiment the polypeptide produced in a process according to the present mvenfon ,s 
an amylolytic enzyme, for example an a-amylase or glucoamylase. These enzymes are commercially used 
to debranS Td degrade starch which thereupon may be further processed, for example .n brewery or 
baking processes. There is a high need for the provision of starch degrading enzymes. 

in a further embodiment of the process according to the present inventor , product.cn of a l^yP^de 
may be achieved by providing expression cassettes as discussed above containing a yeas -amylase or 
olucoamylase signal sequence and inserting the respective expression cassette into a yeast of one of the 
^^S^S^r^W!^ Hansenula . Pichia or Schizosaccharomyces . Productton and secre- 
Son of foSgn pTote^s l^l^^^UolT^^ is obtained by applying the appropna e 
nducfive measure whereupon the produced protein is secreted due to the presence of the yeast a-amylase 
T£Z£^£* sequence. Since it is known from experiments performed by the present mventor, 
L?mesTsig^rse 9 quences are recognised in virtually any yeast genus, the a-amylase or 
sianaVsequences may be applied in many different yeast with success. Also in this case, it .s preferred, to 
uTlTISSS^ or glucoamylase signal sequences derived from the respective genes of S^hwan- 

"Ts^rtney east cells according to the present invention are well suited for transformation and 
expression of any "esired foreign gene . As already outlined above, expression of procaryotic as well as 
eucarS genes" is possible. However, for expression of eucaryotic proteins yeast cells of the genus 
Schwlniomyces containing some or all of the DNA sequences mentioned above are the most preferred 
yeast cells. 
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CHARACTERISATION OF THE a^MYLASE AND GLUCOAMYLASE GENE DEPRESSION IN DIFFERENT 
YEAST GENERA - : 

The N-termini of the deduced amino acid sequences of the «-amylase gene (Fig 1B) and glucoamylase 
„*J?Fin ST exhibit the features of typical leader sequences of secretory protems (Von He.jne 1983. 
SSJ^S^iS? ™e N-telna. amino acids of mature a-amylase secreted by S. occiden- 
LTana dTtermS to be AspValSer. This indicates processing between Arg33 and As P 31 wh.ch ,s unusual 
acting to the Von Heijnes model of site specificity for signal peptidase cleavage Th.s putative 
ceavaoe sKe ^ wi confirmed also for a-amylase expressed in S. cerevisiae . Pichia stjpjchs.K. lactis, S 
oombTand H. potymorpha . However, in these yeast species processing is a.so found to occur between Ala 
25 and Gin 26. resulting In the N-terminal sequence GlnProlle. 

a-amvlase isolated from a culture of S. occidental migrates as a single band of 55 kDa on 80S 
oolvacSde gete: following Endoglucosidise H treatment and subsequent gel electrophoreses a sing e 
2J!mJ£ * observed, fndicating that only one of the two potential N-glycosylation s,tes of the protein 
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is used for glycosylation. It is noteworthy that high mannose glycosylation, as very often found by proteins 
secreted in S. cerevisiae is not accompanying secretion in S. occidentals. 



IDENTIFICATION OF THE TRANSCRIPTION INITATION SITES AND PROMOTER FUNCTION OF THE a- 
AMYLASE AND GLUCOAMYLASE GENE. 

By S1 mapping of polyA-mRNA (Sharp et al.. 1980) the transcription initiation site of the a-amylase 
gene could be localised to be in the region of -42 to -30 (indicated by an interrupted line above the relevant 
sequence in Rg.1B) and the transcription initiation site of the glucoamylase gene was determined to be in 
the region -7 to -10 (underlined in Fig.2B). 



CONSTRUCTION OF AN a-AMYLASE - GLUCOAMYLASE HYBRID GENE 

J5 

In order to facilitate use of the a-amylase promoter and glucoamylase structural gene in further 
constructions additional restriction sites were inserted in front of the translation initiation codon of both 
genes using oligonucleotide directed mutagenesis as described in the AMERSHAM protocol. Correct 
mutagenesis was verified by sequencing according to Sanger (1977). The boundaries between the original 
20 Schwanniornyces genes and the inserted polylinker are shown below : 
a-amylase : 

** * 

5 1 . . TAAAATAAAA6CTCG AG ATG_AG ATJJT. . .3 ' . 

25 

_X_hoJ 

Glucoamylase: 

30 

★ ****** ** 

5 ' . . CTCATGACTGTGTCGACGGATC C A AG ATG^TTTJJ[_ ..ij^ ' . 

« Sail BamHI 



The manipulations resulted in plasmids Ml3aXho (Fig.3) and pMaGAM-B/S (Fig.4). 

40 Thus both the a-amylase promoter and the structural glucoamylase gene can be isolated from plasmid 
M13a Xho and pMaGAM-B/S as suitable DNA fragments. The GAM gene was isolated from pMaGam B/S 
as a 3 2 kb Sail fragment and ligated into the Xhoi site of M13aho. The Bglll fragment harbouring the a- 
amylase promoter/GAM gene fusion was ligated into the single BamHI site of the S. cerevisiae vector YRp7 
(Struhl et al.. 1979) resulting in plasmid YRp7«GAM (Fig 5). S. cerevisiae strain YNN27 was transformed 

45 with this plasmid selecting for TRP prototrophy. Transformants secreted active glucoamylase tnto the 
culture supernatant (50 mU/ml) indicating that the a-amylase promoter fragment can direct expression and 
secretion of foreign proteins. 



50 SECRETION OF a-AMYLASE AND GLUCOAMYLASE IN DIFFERENT YEASTS USING SUITABLE PRO- 
MOTERS. 

Expression of the a-amylase and glucoamylase gene and secretion of the gene products can be 
detected in K.lactis under control of the promoter of the a-galactosidase (LAC4) gene (Breunig et al.. 1984). 
55 Expression STthe" a-amylase and glucoamylase gene and secretion of the both proteins can also be 
detected in S. pombe under control of the ADH1 promoter (Russel et al.. 1983) or of the GAL1/10 promoters 
(Johnston a7Td~Davis 1984). After fusion with the GAL1/10 promoter (Johnston and Davis 1984). DHAS 
promoter (Janowicz et aL. 1985). MOX promoter (Ledeboer et al.. 1985) or FMDH promoter (EP 87 110 
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417.0) expression of the --amylase and glucoamylase gene and secretion of both proteins can be detected 
in Hansenula polymorpha . 

EXPRESSION AND SECRETION STUDIES IN S. OCCIDENTALIS USING THE CELLULASE EN- 
SSSSSUbE D <S OF CLOSTRIDIUM THER MOCELLUM AS A MODEL FOR FOREIGN GENE 
EXPRESSION. 

As a first example for the expression of a foreign gene in S. occidentalis, the celD gene from C. 
th ermocellum (Millet et al.. 1985, Joliff et a!.. 1986b). which codes for a thermostable cellulase, namely an 
endoglucanase (EGD) was used. Advantages in using this system are : 

- specific reaction of EGD, which can easily be monitored 

- a rapid qualitative assay for colonies with EGD activity (see material and methods) 

- availability of antibodies against EGD for specific detection of translation products. 

CHARACTERISATION OF THE GLUCOAMYLASE PROMOTER AFTER FUSION WITH THE celD GENE 

Plasmids were constructed containing a replicon from S. occidentalis (SwARSI) (EP 8711037ai) the 
TRPS gene from S. cerevisiae (Zalkin and Yanowski.1982) as a se.ect.ve marker and, in add-on a 
GAM/celD gene fusTon-UndirThe control of different promoters. In the first step the 4.0 kb EcoRI-Pvull 
fragment from plasmid pBRSwARSGAM (Fig. 6a) was isolated and inserted into the 2296 ^bp R PBR322 
EcoRlSuVfragment. containing the amp gene and the bacterial origin, resulting m plasm,d I^W«*M 
(Rg to), In addRion to the P BR322 seqTJSnce this plasmid carries 3.6 kb from the 5 non codmg reg»n oi 
he GAM gene and the first 208 bp coding for the N-terminal part (including the s.gnal sequence) of the 
alucoZlaL A 3 4 kb Pvull fragment from plasmid pCT603 (Joliff et al.. 1986) containing , he coding 
f£Z me cl gene missing the 5 ill bp was inserted into the Pvull single * - J^^lSSS 
n DBRGC1 (F.g.6b). E. coli transformed with this plasmid results in weak Carboxymethylcellulase (CMCase) 
airty analySd VcoTgo red staining (see materia, and methods) indicating that the glucoamylase 
pSer " ^sW expressed in E. coli. For the construction of a ft occiden^ 
^omw nnrwi /fp 87110370 1T was cleaved with BamHl/Pstl and hgated with the 5.5 kb Bglll-Psti 
S^^it^el fcb^L^H^STfiiriiit from pBRGCI. resuKng in P MPGC1-1 (Fig.Sc) and P MPGC1-2 
(Z 6d) respectively Both plasmids share an in frame fusion of GAM/celD but differ ,n the length of the 5 
uostfearn S oTthe GAM gene. S. occidentalis strain NGA58 (tr P 5 ade) isolated after UV mutagenes.s of 
strain NGA23^was transformed ^STVXfo, and transformants selected for 
and analyzed by means of congo red assay for cellulase activity. Transformants produced active EGD when 
glr u £ ilcing conditions in 2% maltose (Fig 7a). By analysis with •^^^.fS 
could be detected under repressed conditions in 4% glucose (F.g. 7b). NGA58.pMPGC1 1 and 
NGA^f P MPGC1 2 transformants secrete active EGD into the culture supernatant ^^J"™"^ 
under induced conditions. The intact or complete glucoamylase promoter res.des on plas mrf P^CI 2 as 
sSown by 20 times higher activity compared to the shorter promoter fragment ,n gene fusion , o 
(table 1) However the 0.3 kb promoter fragment still harbours regulatory units which lead to catabol.te 
e^sionT gTuccS (Fig.7b) Comparison of the N-terminal GAM sequence to the signal peptidase 
deavage siS of sevSa ofter ecretory proteins (Von Heijne 1983) reveals three potential signal peptidase 
cleavSe s S after the amino acid residues Ala 19, Ala 25 and Ala 34. For determination of the N-term.nal 
2Tf Id of fte maturolucoamylase the protein was isolated for N-terminal amino acid sequencing. 
Hoover he N ZSH bSd and therefore could not be sequenced. To confirm the correct cleavage 
site of the GAM sTna! sequence in S. occidentalis , the regions between the potential s.gnal peptidase 
cLagHites Snd I the PvuJgene fusioT sitTirediTeted in the GAM/celD fusion by in vjtro mutagenesis to 
i study the influence on secretion of active EGD. 

STUDIES OF THE USE OF SUITABLE S. CEREVISIAE PROMOTERS IN THE EXPRESSION OF FOREIGN 
GENES IN S. OCCIDENTALIS . 

5 The ADH1 promoter (Hiteeman et al., 1981) was isolated as a 1.45 kb BamHI-Sall fragment from 
n,«Iid nFM2 1 /MulleVet al 1987) and ligated into plasmid P CJD5-1 (EP 87 110370.1) after cleavage with 
BamH, £d ^SallTslg tpMPADHi ( 9 Fig.8a). The GAM gene was excised as a Sail fragment from 
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pJDcg-15 (see below) and inserted into the single Sali site of pMPADHI (Fig. 8b) resulting in plasmid 
pMPAG (Rg. 8b). By the means of in vivo recombination (Ma et al. t 1987) the GAM promoter from 
pMPGC1-2 (Rg 6d.) was replaced by the ADH1 promoter of pMPAG leading to plasmid pMPAGC (Rg. 8c). 
Plasmid pMPGC1-2 was linearized with BamHI; the unique BamHI site of pMPGCl-2 is localized between 
the TRP5 coding sequence and the glucoamylase promoter. The 2.75 kb Clal fragment of pMPAG 
containing the ADH1 promoter shares homologies to the TRP5 gene as well as to the GAM coding region 
up to position +208. For in vivo recombination NGA58 was transformed with BamHI linearized pMPGC1-2 
together with approx. 10 fold excess of Clal fragment using the transformation protocol of Ito et aL, 1983. 

Recombinants were screened for their ability to hydrolyze CMC on 4% glucose. Under these conditions 
celD expression is repressed in transformants harbouring recircularized pMPGC1-2. From 146 transformants 
7 colonies expressing cellulase in the presence of 4% glucose were isolated. NGA58 containing the "in 
vivo" constructed plasmid pMPAGC expresses active EGD constitutiveiy (see Rg. 7). This result dem- 
onstrates that the newly constructed plasmid pMPADHI serves as a vector for foreign gene expression. 

In another experiment a recombinant plasmid was constructed by in vivo recombination to study the 
expression of the GAM/celD gene fusion under control of the PDC promoter (Das and Hollenberg, 1982). 
Therefore the 6.0 kb Sphl-Aval fragment from plasmid pJDcg-15 (EP 87 110370.1) carrying the 
glucoamylase gene fused to the PDC promoter was subcloned into the respective restriction sites of 
YRpJD2 (EP 87 110370.1) resulting in plasmid pMPPG (Rg. 9a). 

For in vivo recombination pMPPG was cleaved with Clal and used together with BamHI linearized 
pMPGC1~2 "^transformation of NGA58. Homologous recombination results in plasmid pMPPGC (Fig. 9b) 
harbouring a GAM-celD fusion under control of the PDC promoter. This construction leads to an expression 
in Schwanniomyces occidentaiis NGA58 inducible by glucose and partly repressable by maltose (Table 1). 
Transformants were analysed for expression of active EGD using the qualitative congo red assay (Rg. 7). 
The results indicate that the 0.3 kb GAM promoter fragment leads to regulated expression, however less 
efficient that the 1.3 kb fragment (for comparison see table 1 and Fig. 2A). Expression under the ADH1 
promoter leads to efficient constitutive expression. The expression level is comparable with that of induced 
expression directed by the 1 .3 kb GAM promoter. 

The fact that plasmids can be constructed by in vivo recombination is a proof for homologous 
recombination and hence offers the possibility for stable site directed integration into the genome of S. 
occidentaiis, a prerequisite for the industrial production of foreign proteins. 

The expression of the ot-amylase gene and glucoamylase gene and secretion of the both gene products 

in S. occ identaiis could also be achieved under control of the phosphoglycerate kinase (PGK) promoter 
(Dobson et al.. 1982). glyceraidehyde-3-phosphate dehydrogenase (GAPDH) promoter (Holland and Holland 
1979), copperchelatin (CUP1) promoter (Karin et al., 1984. Butt et al.. 1984) and alcoholdehydrogenase 2 
(ADR2 or ADH2) promoter (Russell et aL, 1983, Beier et al., 1985) or galactokinase (GAL1/10) promoter 
(Johnston and Davis, 1984). 



Construction of pJDcg-15 : 

The GAM gene was isolated as a 3.2 kb BamHI-Sall DNA fragment from plasmid pMaGAM B/S (Fig.4) 
and inserted into the respective restriction sites of plasmid pBM272 Johnston & Davis. 1984). The PDC 
promoter was isolated from plasmid pCP202 (Kellermann & Hollenberg, 1988) as a Sall-Sphl fragment and 
ligated into the respective single sites of vector YRp7 (Struhl et al.. 1979). To be able to isolate the PDC1 
promoter as a BamHI fragment the 3.2 kb Sail fragment of PMaGAM B/S containing the GAM gene was 
fused to the PDC promoter via the single Sail site. The resulting BamHI fragment containing the PDC- 
promoter was ligated into the single BamHI site of pBM272 containing the GAM gene. The final plasmid is 
called pJDcg-15(Fig. 8b). 



EXPRESSION OF HUMAN GRANULOCYTE/MACROPHAGE COLONY STIMULATING FACTOR (hGM- 
CSF) 

The gene coding for hGM-CSF (Cantrell et al.. 1985) was obtained from British Biotechnology Ltd. as a 
Hindlll-EcoRl fragment Position +52 of hGM-CSF gene was fused to position + 102 of the GAM signal 
sequence under control of the ADH1 or GAM promoter. This fusion results in a hybrid protein in which the 
endopeptidase cleavage site of the GAM signal sequence is linked with a Met included in the synthetic 
gene offered by British Biotechnology Ltd.. followed by the first amino acid of mature hGM-CSF (Ala 18). 
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This fusion was inserted into a S. occidentalis expression vector and transformed into NQA58. Western blot 
analysis revealed secretion of hGM-CSF protein. 

5 EXPRESSION OF I NTERLEUKI NE-2 

The gene coding for IL-2 (Taniguchi et aL, 1983) was obtained from British Biotechnology Ltd., as a 
EcoRI-Hindlll fragment Position +61 of IL-2 gene was fused to position + 102 of the GAM signal 
sequence under control of the ADH1 or GAM promoter. This fusion results in a hybrid protein in which the 
w endopeptidase cleavage site of the GAM signal sequence is linked with a Met included in the synthetic 
gene offered by British Biotechnology Ltd., followed by the first amino acid of mature IL-2 (Ala 21). This 
fusion was inserted into a S. occidentalis expression vector and transformed into NGA58. Western blot 
analysis revealed secretion of IL-2 protein. 

75 

ISOLATION OF AN ADDITIONAL SwARS ELEMENT AND A NEW SELECTABLE MARKER 

A new SwARS sequence (SwARS2), localized downstream of the coding region of the GAM gene, could 
be identified. To analyze the function of SwARS2 a vector was constructed containing the Trp5 gene and 

20 SwARS2 in pBR322. resulting in pMPTS2-A and pMPTS2-B, respectively (Fig. 10). For the construction of 
pMPTS2-A and pMPTS2-B the 5.5 kb BamHI - Bglll fragment of pBRSwARSGAM, constructed by insertion 
of the EcoRI-fragment shown in Fig.2a into the single EcoRl site of plasmid pBRSwARSI, (disclosed in EP 
87 110370.1), was isolated and ligated with 3.2 kb BamHI TRP5 fragment obtained from YRpJD2. High 
frequency transformation of S. occidentalis is obtained using this plasmid. 

25 As an additional selectable marker the LEU2 gene of S. occidentalis was isolated. The LEU2 gene, 
coding for isopropyl malate dehydrogenase, was cloned by functional complementation of the mutant AH22 
(his4leu2) (Hinnen et a!.. 1978) of S. cerevisiae by transformation with a cosmid genomic library from S. 
occidentalis . 

The LEU2 gene was found to reside on a 9 kb EcoRl fragment of Schwanniomyces genomic DNA; it's 
30 function could be abolished by insertion of the TRP5 gene of S. cerevisiae . The disrupted LEU2 gene was 
integrated into the genome of NGA58 by homologous recombination resulting in a new strain NGA581 
(TRP5, LEU2, and ADE). An analogous experiment with the cloned HIS4 gene of S. occidentalis (EP 87 1 10 
370.1) led to a strain NGA58h (TRP5, HIS4, ADE). 

35 

IMPROVED EXPRESSION OF HETEROLOGOUS GENES USING THE T7 RNA POLYMERASE AND T7 
PROMOTER AND TERMINATER FOR THE EXPRESSION OF FOREIGN GENES IN S. OCCIDENTALIS 

T7 RNA polymerase (Fuerst et a!., 1986) can be expressed in S. occidentalis after fusion of the 
40 structural gene (Dunn and Studier, 1984) using the BamHI restriction site near the translation initation codon 
with suitable promoters asa-amylase, GAM (0.3 kb fragment) or PDC and homologous integration into the 
genome via any homologous cloned gene, e.g. HIS4, LEU2, AMY1 and GAM. A suitable fragment, 
containing the fusion flanked by homologous cloned sequences can be isolated and used together with 
pCJD5-1 (Fig. 6c) for co-transformation of NGA58, selecting on YNB containing 0.5% casaminoacids. 
45 Colonies containing the integrated T7 RNA polymerase gene can be identified by Southern analysis after 
isolation of transformants mutated in the target gene. 

For the expression of foreign genes a S. occidentalis expression vector was constructed containing 
SwARS2, the ColE replication origin of E. coli, a suitable selective marker for S. occidentalis (e.g. TRP5, 
HIS4, LEU2) and a cassette containing the T7 polymerase promoter and terminator sequences isolated from 
so plasmid pAR2529 (Fuerst et al., 1986), separated by the polylinker sequence from M13mp19. To study the 
efficiency of the T7 system we expressed a GAM/celD fusion (GAM signal sequence and structural celD 
gene obtained as a BamHI-Asp 718 fragment from pMPGC1-2) under control of the T7 promoter by 
insertion into the BamHI-Asp7l8 site of the polylinker sequence. Using this system active cellulase is 
secreted into the culture medium. 

55 

MATERIALS AND METHODS 
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1. The microorganisms used are as follows : 

Schwanniomyces occidentalis NGA23 : (DSM 3792) 
Schwanniomyces occidentalis NGA58 
5 Kluyveromyces Lactis SD1 1 : (DSM 3795) 

Schizosaccharomyces pombe LEW -32. HIS5-303 (DSM 3796) 
Saccharomyces cerevisiae YNN27 (URA3-52, TRP1-298, GAL2) 

The Escherichia coil strain used is as follows E. coli HB101 F-. hsd S20 (r B - m B -), recA13 ( ara-14 , proA2. 
jo lacYI , galK2, rpsL20 (SmO, xyl-5, mtl-1 . supE44) 



2. Culture media : 

is Schwanniomyces occidentalis strain NGA58 (isolated by UV mutagenesis of strain NGA23) was grown 
at 30 C in YNB (0.67% yeast nitrogen base without amino acids) supplemented with appropriate amino 
acids and with 2% of soluble starch when indicated. 

S. cerevisiae strain YNN27 ( URA3-52 , TRP1-289 , GAL2) (Stinchomb et al 1980); K. lactis strain SD11 
TRPT(Das and Hollenberg, 1982); S. pombe strain LEU1-32, HIS5-303 (obtained from W.D. Heyer): These 
20 slralns were grown under conditions as descirbed above for Schwanniomyces occidentalis with the 
replacement of 2% soluble starch by 2% glucose, where indicated. 

The culture media for S. occidentaiis and Hansenula polymorpha were buffered with 0.2 M NaPO* 
buffer pH 6.2; those for S. cerevisiae and K. lactis were buffered with 0.1 M citrate buffer pH 6 and those for 
S. pombe with 0.05 M acetate buffer pH 6. 
25 ~" The pH optimum for the cellulase (Jotiff et al.. 1986a). a-amylase and glucoamylase (Wilson and 
Ingledew, 1982) is approx. pH 6. 

E. coii HB101 (Bolivar et al., 1977) was grown in LB medium (Maniatis et aL. 1982) with penicillin G 
(150"ug7ml). 

30 

3. Miscellaneous methods 

Plasmid DNA was purified either by CsCI gradient centrifugation (Maniatis et al., 1982) or by rapid 
alkaline extraction of plasmid DNA described by Birnboim and Doly (1979). 

35 Yeast crude extracts were prepared by the method of Schatz (1979). Yeast minilysates were prepared 
by the method of Sherman et al., (1983). 

DNA fragments were isolated by the "low melting agarose" procedure as described by Gafner et al., 
(1983). Southern hybridization was carried out as described by Southern (1975). 

Yeast transformation was performed according to Klebe et al. (1983) or Ito et al. (1983). The 

40 transformants or integrants were tested for expression and secretion of cellulase using either the qualitative 
congo red assay (Teather and Wood, 1982: halo formation on carboxymethylcellulose agar plates) or a 
quantitative cellulase assay according to Joliff et ai. (1986a). One unit (U) is defined as the amount of 
enzyme that released 1 umol para-nitrophenol per min at 60° C. Western analysis was performed as 
described by Towbin et al. (1979). 

45 The transformants or integrants were tested for expression and secretion of a-amylase using either a 
standardized a-amylase enzyme test from Merck, W. Germany or a qualitative starch degradation test (halo 
formation on starch agar plates after staining with iodine), as follows : in this test the rate of formation of 2- 
chloro-4-nitrophenol is determined by photometry at 405 nm in 0.1 M potassium phosphate at 37 C. The 
enzyme unit (U) is defined as the amount of enzyme catalysing the formation of 1 umol 2-chloro-4- 

50 nitrophenol per min at 37* C. 

Glucoamylase activity was measured by a stop assay method: 
After incubation of the sample in 10% soluble starch in 0.05M KH 2 PO*-NaOH (pH 5.0) at 50 C the amount 
of glucose produced is determined by the glucose dehydrogenase method (system glucose Merck). The 
quantity of NADH formed is proportional to the glucose concentration. The enzyme unit (U) is defined as 

55 the amount of enzyme catalysing the formation of 1 umol glucose/min at 50 C. 



12 



EP 0 394 538 A1 



TABLE 1 



Transformant 


ODeoo nm 


EDG-activity 






mU/ml 


NGA58: 


2.3 


0.12 


pMPGCl-1 


5.3 


0.73 


(0.3 kb promoter) 






NGA58: 


2.0 


0.53 


pMPGC1-2 


7.9 


1.83 


(1 .3 kb GAM promoter) 






NGA58: 


3.0 


0.52 


pMPAGC 


5.9 


1,07 


ADH1 promoter 






NGA58: 


2.2" 


0.40* 


pMPPGC 


4.5* 


0.64* 


PDC1 -promoter 


2.9 


0.19 




7.2 


0.31 


Transformants were grown in YNB, 0.5% 


Casaminoacids, 2% maltose, 20 mg/! adenine and 


0.2 M NaP buffer pH 6.2 







* : values for transformants grown in 4% glucose 

25 " 
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Claims 

30 

1 . A yeast cell of the genus Schwanniomyces , 
characterized in that 

said yeast cell contains at least one expression cassette comprising : 
a) a first DNA sequence serving as a regulon, 
35 b) optionally a second DNA sequence coding for a signal peptide 

c) a third DNA sequence coding for a foreign protein, and 

d) optionally a fourth DNA sequence serving as a terminator. 

2. The yeast cell according to claim 1 , 
characterized in that 

40 said first and/or second and/or fourth DNA sequence are derived from a gene coding for an amylolytic 
enzyme. 

3. The yeast cell according to claim 2, 
characterized in that 

said amylolytic enzyme is an a-amylase or glucoamylase. 
45 4. The yeast cell according to any of claims 1 to 3, 
characterized in that 

said first and/or second and/or fourth DNA sequences are derived from a yeast of a member of the genera 
Debariomyces. Saccharomycopses , Lipomyces , Pichia , or Saccharomyces , preferably Schwanniomyces . 

5. The yeast cell according to claim 4, 
so characterized in that 

said yeast is Schwanniomyces occidentalis . 

6. The yeast cell according to any of claims 1 to 5, 
characterized in that 

said first DNA sequence is comprised by a 1.8 kb Bglll-Xhol-fragment proceeding the structural gene 
55 coding for the Schwanniomyces a-amylase or the 1.3 kb BamHI-Pvull fragment proceeding the structural 
gene coding for the Schwanniomyces occidentalis glucoamylase. 

7. The yeast cell according to claim 5, 
characterized in that 
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said first DNA sequence is comprised by part or all of the DNA sequence corresponding to bases -1 to -540 
of the region proceeding the structural gene coding for a-amylase or part or ail of the DNA sequence 
corresponding to bases -1 to -320 of the region preceeding the structural gene coding for glucoamylase. 

8. The yeast cell according to any of claims 1 to 5, 
characterized in that 

said first DNA sequence corresponds to one of the following regulons : 

a) ADH1.ADH2, PDC1, GAL1/10, PGK, GAPDH, the reguion being preferably obtained from Sac- 
charomyces cerevisiae 

b) LAC4 the reguion being preferably obtained from Kluyveromyces lactis , or 

c) corresponding regulons, preferably obtained from Schwanniomyces . 

9. The yeast cell according to any of claims 1 to 5, 
characterized in that 

said first and/or fourth DNA molecule corresponds to a viral promoter and/or terminator, preferably from E. 
coli phage T7. 

10. The yeast cell according to any of claims 1 to 9, 
characterized in that 

said second DNA sequence is coding for all or part of at least one of the following peptides : 

a) MetArgPheSerThrGluGlyPheThrSerLysValvAIAIaAialle LeuAlaPheSerArgLeuValSerAla; 

b) MetArg PheSerThrGluGly PheThrSerLysVal ValAlaAlalle Leu AlaPheSer- 
ArgLeuValSerAlaGlnProllellePheAspMetArg; 

c) MetllePheLeuLysLeulleLysSerlleVallleGlyLeuGlyLeu ValSerAla; 

d) MetllePheLeuLysLeulleLysSerlleVallleGlyLeuGlyLeu ValSerAlalleGlnAlaAlaProAla; 

e ) MetllePheLeuLysLeulleLysSerlleVallLeGlyLeuGlyLeu 

ValSerAlalleGlnAlaAlaProAlaSerSerlleGlySerSerAlaSerAla 

11. A yeast cell according to claim 10, 
characterized in that 

the DNA sequence coding for said peptide corresponds to part or all of one of the following DNA 
sequences: 

a) ATGAGATTTTCAACTGAAGGATTTACAAGTAAAGTTGTTGCAGCAATT TTAGCATTCTCAAGATT- 
GGTATCTGCT, 

b) ATGAGATTTTCAACTGAAGGATTTACAAGTAAAGTTGTTGCAGCAATTT TAGCATTCTCAAGATT- 
GGTATCTGCTCAACCGATTATTTTTGALATGAGA; 

c) ATGATTTTTCTGAAGCTGATTAAAAGTATAGTAATTGGTTTGGGATTA GTTAGTGCT; 

d) ATGATTTTTCTGAAGCTGATTAAAAGTATAGTAATTGGTTTGGGATTA GTTAGTGCTATCCAAG- 

CAGCCCCTGCC; 

e) ATGATTTTTCTGAAGCTGATTAAAAGTATAGTAATTGGTTTGGGATTA GTTAGTGCT ATCCAAG- 
CAGCCCCTGCCTCTTCGATTGGATCTAGTGCTTCA GCA. 

12. The yeast cell according to any of claims 1 to 11, 
characterized In that 

said third DNA sequence coding for a foreign protein is a natural or synthetical DNA sequence encoding a 
cellulase. interleukine, insulin like growth factor, interferon, tymphokine, human growth factor, nerve growth 
factor, aprotinin, insulin, hirudin, hormones, blood clotting factors, hepatitis B surface or core antigens, viral 
or bacterial vaccines or human granulocyte/macrophage colony stimulating factor, 

13. The yeast cell according to any of claims 1 to 12, 
characterized in that 

said fourth DNA sequence originates from a yeast of a member of the genera Saccharomyces , Pichia, 
Hansenula , preferably Schwanniomyces . 

14. The yeast cell according to claim 13, 
characterized in that 

said fourth DNA sequence corresponds to part or ail of the terminator of the Schwanniomyces a-amyiase 
gene or part or all of the terminator of the Schwanniomyces glucoamylase gene. 

15. The yeast ceil according to claim 14, 
characterized in that 

said terminator is comprised by nucleotides 1537 to 1740 of the Schwanniomyces cr-amylase gene or 
nucleotides 2875 to 3320 of the Schwanniomyces glucoamylase gene. 

16. The yeast ceil according to any of claims 1 to 15, 

characterized In that B ^. 

said first and/or second and/or third and/or fourth DNA sequence has been modified by insertion or deletion 
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or substitution of one or more nucleotides while the biological activity of the DNA sequence is retained or 
improved. 

17. The yeast ceil according to any of claims 1 to 16, 
characterized in that 

s said expression cassette is carried by a circular or linear vector, which optionally comprises one or more 
selective marker genes. 

18. The yeast cell according to claim 17, 
characterized in that 

said vector comprises any of the following selective marker genes : 
io TRP5, LEU2, ADE1, ADE2, HIS3, HIS4, URA3, LYS2, preferably obtained from Saccharomyces , Hansenula 
Pichia or Schwanniomyces , AMY1 or GAM1, preferably obtained from Schwanniomyces . 

19. The yeast cell according to any of claims 17 and 18, 
characterized In that 

said vector further comprises a DNA sequence capable of controlling autonomous replication and stable 
75 maintenance of the vector in the host organism. 

20. The yeast cell according to claim 19 
characterized In that 

said DNA sequence is derived from the genome of a member of the genera Saccharomyces , Pichia , 
Hansenuia or Kluyveromyces , preferably Schwanniomyces . 
20 21 . A yeast cell according to any of claims 19 or 20, 
characterized In that 

said DNA sequence is selected from the autonomously replicating sequences ARS, preferably an ARS 
derived from the respective host organism. 

22. The yeast cell according to claim 21 , 
25 characterized in that 

said DNA sequence is SwARSI or SwARS2, capable of controlling autonomous replication and stable 
maintenance of the vector in Schwanniomyces . 

23. The yeast cell according to claim 22, 
characterized In that 

30 said DNA sequences are comprised on a Clal-BamHI fragment (SwARSI) or a EcoRI-BamHi-fragment 
(SwARS2) of Schwanniomyces genomic DNA, 

24. The yeast cell according to any of claims 19 to 23, 
characterized In that 

said DNA sequence has been modified by deletion, insertion or substitution of one or more nucleotides, 
35 while retaining or improving its biological function. 

25. The yeast cell according to any of claims 1 to 24, 
characterized in that 

said yeast cell comprises any DNA-sequence homologous to genomic Schwanniomyces DNA, preferably 
homologous DNA-sequences flanking the expression cassette. 
40 26. The yeast cell according to any of claims 1 to 25, 
characterized in that 

said expression cassette is integrated into the genome, optionally in multiple copies, preferably via 
homologous recombination. 

27. The yeast cell according to any of claims 1 to 26, 
45 characterized in that 

said yeast cell is derived from the genus Schwanniomyces and is capable of glycosylation. 

28. A process for the production of a polypeptide, wherein a yeast host organism is cultivated under 
suitable conditions and the polypeptide is recovered in a manner known per se, 

characterized in that 
so said yeast host organism is a yeast cell according to any of claims 1 to 27. 

29. The process according to claim 28 
characterized in that 

yeast cells of the species Schwanniomyces occidentalis are used as a host organism and are preferably 
cultivated on starch.dextrin, maltose and/or plant biomass. 
55 30. The process according to any of claims 28 or 29, 
characterized In that 

yeast cells of the genus Schwanniomyces are used for the production of single cell protein. 
31. The process according to any of claims 28 to 30, 
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characterized in that 

said polypeptide is a-amylase or glucoamylase. 

32. The process according to claim 28, 
characterized in that 

5 the signal sequence of a yeasta-amylase or glucoamylase gene is used for the production and secretion of 
foreign protein in yeast of the genera Saccharomyces , Kluyveromyces , Hansenula , Pichia or Schizosac- 
charomyces . 

33. Use of a yeast ceil according to any of claims 1 to 27 for transformation and expression of foreign 
genes. 
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Description 

The present invention relates to yeast cells of the genus Schwannomas and a process for the production of 

Compression of eukaryotic proteins of biotechnogical relevance there exists a variety of host systems, which are 
sele^yS^e with the specific requirements. The most frequently used host organism ,s Esche.r.ch L a.cplj, 
SSe^o^representative of prokaryotic host organisms; Saccharomy.ces cerevjsiae and animal t-ssue cuf- 
Slre used afeu^otic host cells' For certain purposes, e.g. where expression of a glycoprotein ,s desired, 
^LTeL or oLr prokaryotic organisms is no, approprete, since same lack any 
On the other h"an"d" the mass production of protein in tissue cultures still is cumbersome and expense, therefore the 
oroanlm of choicermany instances is yeast of the well characterized genus S a ccAaromy.c_es. S ce.reyis.ae « able 
^^ J^ J^?^ e.g. polypeptides like MQ1 (Kurjan and Herskowitz, 1982), ^endorphin or a-,nter- 
feron a" 1984). fowever proteins o, higher molecular weight are captured in the penplasmic space (Emr et 
al 1 9B1 ) or the cvtoplasma, thus causing decreased growth rates or even death of the cell. 

Therefore it isYn object of the present inventbn to provide an improved eukaryotic expression system, wh,ch ,s 
octionallv capable of secreting proteins encoded by a foreign DNA into the culture medium. 
^ESStfS solved by a yeas, cell of the genus Schwanniom,ces. wherein said yeast cel. conta.ns at 
least one expression cassette comprising : 

20 a) a first DNA sequence serving as a regulon, 

b) optionally a second DNA sequence coding for a signal peptide 

c) a third DNA sequence coding for a foreign protein, and 

d) optionally a fourth DNA sequence serving as a terminator, wherein the first, second and fourth DNA sequence 
are derived from yeast. 

The invention, which comprises further subjects, is now described in a more detailed manner by the following 
30 description, examples and figures. 

BRIEF DESCRIPTION OF THE FIGURES 
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Fig.1 

A: Restriction map of the AMY1 gene encoding a-amylase and its 5' and 3' noncoding region 

B- Nucleotide sequence of the AMY1 gene and its flanking region. In the 5' region the transcription i initation 
site is indicated by a dashed line above the sequence. Wfthin the structural gene the presumptive s.gnal pepti- 
mSSSJL are indicated by star-symbols. The two potentiaL gelation srtes. the cyste, .residues 
presumably forming four disulfide bonds (concluded by homology to the A, oryzae enzyme) are underlined. 

region the transcription termination signal TAG. . .TATGT. . .TTT is indicated by a solid line. 

Fig.2 

A: Restriction map of the GAM1 gene encoding glucoamylase and its 5' and 3' noncoding region 

B- Nucleotide sequence of the GAM1 gene and its flanking region. In the 5" region the transcription in itiaton 
site is indicated by a dashed line. Within the structural gene the presumptive s.gnal peptides cleavage sites 

are indicated by star-symbols. 



Fig.3 



55 



M13 Xhol. The EcoRI-Scal fragment, containing a 1 .8 kb promoter fragment and appro), OA mj> structural 
gene fragment of the a-amylase gene, was subcloned into M13mp8 previously digested w..h EcoRI and Smal. 
By site directed mutagenesis a Xhol-site was inserted just in front of the ATG codon: 
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** * 

5 1 . . TAAAATAAAAGCTCGAG A J G_AG A T J_T • 

Positions where bases have been exchanged by site directed mutagenesis, are indicated by asterisks. 

pMaGAM-B/S The Bglll fragment containing the glucoamylase gene and its 5' and 3' flanking region was sub- 
cloned into the BamHI site of vector pMAC5-8 (Kramer et al. , 1 984). By site directed mutagenesis using the gapped 
duplex DNA methode (Kramer et al., 1 984) a BamHI site and a Sail site were inserted just in front of the translation 
initiation codon ATG: 

* ****** ★* 

5 • . . CTCATG ACTGT6TCSACGGATCCAA6^JL&IJJ-LIaj-v-1-- 

Positions affected by the site specific mutagenesis are indicated by asterisks. 

YRp7aGAM The GAM gene was isolated from pMaG AM-B/S as a Sail fragment and ligated into the Xhoi site of 
M13oXho (Fig.3). The Bglll fragment harbouring the a-amylase promoter GAM gene fusion was ligated into the 
single BamHI site of the S^cereyisiae vector YRp7 resulting in plasmid YRp7aGAM. 

Construction of a occid.entalis vectors for expression of foreign genes using the promoter and the signal sequence 
of the glucoamylase gene. 

a- Insertion of the 4 0 kb EcoRI-Pvull fragment from pBRSwARSGAM, containing the complete GAM promoter 
and the first 208 bp of the GAM encoding sequence into pRR322, cleaved with EcoRI/Pvull, resulting in vector 
pBRGAM. 

b- Insertion of the 3 4 kb Pvull fragment from plasmid pCT603, containing the structural gene coding for cel- 
lulase from base pair position +112, into the Pvull site of plasmid pBRGAM, resulting in vector pBRGd. 

c Insertion of the 5.5 kb BglH-Pstl fragment from plasmid pBRGCI into BamHI/Pstl cut pCJD5-1 . The resulting 
plasmid, pMPGCI -1 contains 321 bp of the GAM 5' non coding region and the first 208 bp of the GAM coding 
region fused to position +112 of the celD gene. 

& Insertion of the 6.5 kb BamHI-Pstl fragment from plasmid pBRGCI into BamHI/Pstl cut pCJD5-1 . The re- 
sulting plasmid, pMPGCI -2 contains 1 .3 kb of the GAM 5* non coding region and the first 208 bp of the GAM 
coding region fused to position +112 of the celD. 

Promoter studies using the qualitative endoglucanase D activity assay (congo red staining) with different trans- 
formants: NGA58: pMPAGC, NGA58: pMPPGC, NGA58: pMPGCI -1, NGA58: pMPGCI -2 

a: Growth on 2% maltose 
so b: Growth on 4% glucose 

Fig 8 

Construction of a S, occjdentalis vector for expression of foreign genes under control of the ADH1 promoter. 

55 a: Insertion of the 1 .45 kb BamHI-Sall ADH1 promoter fragment of plasmid pFM2-1 into pCJD5-1 , previously 

cut with BamHI/Sall resulting in plasmid pMPADHL 

b: Insertion of the 3.2 kb Sail fragment of pJDcg-15 into the Sail site of pMPADHI, resutting in pMPAG. 
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c: Exchange of the GAM promoter in plasmid pMPGCI -2 (Fig 6d) for the ADH1 promoter from plasmid pMPAG 
by jn. vivo recombination, resulting in plasmid pMPAGC. 

Construction of a a occjde_nta_li_s vector for the expression of foreign genes under control of the PDC promoter. 

a- Insertion of the 6.0 kb Sphl-Aval fragment from plasmid pJDcg-1 5, containing the PDC promoter fused to 
the GAM gene into YRpJD2 previously cleaved with Sphl-Aval, resulting in plasmid pMPPG. 

b: Exchange of the GAM promoter in plasmid pMPGCI -2 (Fig. 6d) for the PDC promoter from plasmid pMPPG 
by in vivo recombination, resulting in plasmid pMPPGC. 

Fig. 10 

Construction of an autonomous replicating vector (containing SwARS2) for transformation of & occjd_ent%lis 
The 3.2 kb TRP5 BamHI fragment from plasmid YRpJD2 was ligated with the 5.5 kb BamHI-Bglll fragment of 
plasmid pBRSwARSGAM (Fig 6a), resulting in pMPTS2-A and pMPTS2-B. 

ABBREVIATIONS USED THROUGHOUT THE APPLICATION 

Restriction endonucleases : 
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Aa 


-Aatll 


A 


- Asp718 
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- BamHI 


Bg 


- Bglll 
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-Clal 
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- EcoRI 


EV 


- EcoRV 


H 


- Hindlll 
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-Pvull 


Ps 


-Pstl 
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-Sail 


Sc 


- Seal 


Sm 


- Smal 


Sp 


-Sphl 


X 


-Xhol 



Throughout this application various publications are referenced by mention of the first author and the year of 
publication within parentheses. Full citation of the references may be found at the end of the Specification as an annex 
listed accordingtotheiralphabeticalorderimmediatelyfollowingthespecification.Thedisclosures of these publicatons 
in their entireties are hereby incorporated by reference into this application in order to more fully describe the state of 
the art as known to those skilled therein as of the date of the invention described and claimed herein. 

The yeast cell according to the present invention is characterized by containing an expression cassette which in 
turn is composed of three to four different components, being arranged in operable linkage in order to ensure efficient 
expression of a desired polypeptide, optionally followed by secretion of same. These components are defined to be 
the following : 

a) A first DNA sequence serving as a regulon. The term "a regulon" as used throughout the application comprises 
any cis-acting DNA sequence involved in the regulation of expression of a given structural gene. The term thus 
embraces sequences preceeding a given coding sequence, for example, a promoter and sequences wh.ch are 
recognised by transcription factors or other proteins involved in transcription regulation. The "first DNA sequence 
is thus not limited to sequences corresponding to known promoters. Also included are DNA sequences corre- 
sponding to natural occuring regulons, which have undergone insertion, deletion or substitution events, but st.ll 
retain their activity as regulon. Further the term "regulon" includes DNA sequences involved in regulation of tran- 
scription, which have been synthesised chemically or obtained by methods of gene technology. 

b) An optional second DNA sequence coding for a signal peptide. The meaning of this second DNA sequence 
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includes various DNA sequences, directing the secretion of any polypeptide containing the thus encoded peptide 
at its N-terminal end. 

c) A third DNA sequence coding for a foreign protein. This coding sequence coding for the signal peptide in order 
5 to retain a proper reading frame for the third DNA sequence. The fusion thus requires application of methods or 

recombination DNA technology known to every person skilled in the art. For methods enabling to design proper 
fusions it is in general referred to Maniatis et al. (1982), wherein appropriate methods, e.g. various ways to treat 
restriction fragment ends, cloning of linker molecules and similar procedures are described in great detail. The 
foreign protein coded for by the third DNA sequences may be any protein which to express there exists some 
70 need. Since the secretion system is capable of secretion not only of small molecules, but also of large proteins, 

expression of any desired protein should be possible. For examples it is referred to the discussion below. 

d) Optionally a fourth DNA sequence serving as a terminator. As the terminator any sequence may be used, which 
efficiently terminates transcription in the respective host organism. Terminators may be for example sequences 

is prone to formation of hairpin structures and/or polyadenylation sites. In a preferred embodiment the terminator is 

derived from the same gene as the second DNA sequence, i.e. the regulon, is derived from. 

knnwn tschniaues. ODtionallv the DroDer fusion will be 

MIC? IUUI IJJWI H-M HO UIW wwi i ivm ■ • — — •— ■ ■ a — - i ■ ' 

verified for example by sequencing the boundaries. 
20 The yeast cell according to the present invention contains at least one expression cassette comprising some or 

all of the above mentioned DNA sequences. It provides a convenient tool for the industrial expression of any desired 
protein. 

In a preferred embodiment said first and/or second and/or fourth DNA sequence are derived from a gene coding 
for an amylolytic enzyme, for example a-amylase or glucoamylase, which enzymes in most organisms are induced 
25 upon contact with starch. The DNA sequences may be derived from a yeast of a member of one of the genera Debar_- 
iomyces, Saccha_rqmy_ces, Uppmyces, Pjchia., or Saccharomyces, but yeast of the genus Schwannjqmyces are pre- 
ferred. 

Yeast species of the genus Schwannjqmyces have an efficient expression and secretion apparatus, enabling them 
to grow on starch as a sole carbon source. Starch is hydrolysed to glucose by two amylolytic enzymes, secreted into 

30 the supernatant, namely a-amylase (a- 1 ,4-glucan 4-glucanohydrolase E.C.3.2.1.1 ) and glucoamylase (syn. amyloglu- 
cosidase) (a-1,4-glucanglucohydrolase, E.C.3.2.1.3.; with debranching activity E.C.3.2.1.9.). The amylolytic enzymes 
of Schwanniomyces occidentalis, formerly also called Schwannjqmyces castellij and Schwannjqmy.ces_aMuvius (Price 
etal.,~1~9~78),~a~re well documented (Oteng-Gyang etal., 1981; Sills etal., 1982, 1984a, 1984b; Wilson etal., 1982). 
The genes for a-amylase and glucoamylase from a yeast of the genus Schwanniomyces have been disclosed by 

35 the present inventors in EP-A-0260404. However, at that time the DNA or peptide sequences directing secretion have 
not been identified. Furthermore their use for the establishment of an efficient expression and secretion system in yeast 
was not known then. 

It is preferred, to use as a first DNA sequence part or all of a 1.8 kb Bglll-Xhol fragment which preceeds the 
structural gene coding for the Schwannjqmyce_s a-amylase, or part or all of a 1 .3 kb BamHI-Pvull fragment proceeding 

40 the structural gene coding for'the Schwanniomyces occidentalis glucoamylase. Further if only smaller fragments are 
to be used, it is preferred, to use a~DNA sequence comprising part or all of the DNA sequence corresponding to bases 
-1 to -540 of the region proceeding the structural gene coding for a-amylase or part or all of the of DNA sequence 
corresponding to bases -1 to -320 of the region preceding the structural gene coding for glucoamylase. The respective 
DNA fragments are fully active as promoters, inducible by starch, dextrin, maltose. 

45 it is obvious, that replacement of bases or base pairs, not involved in the transcriptional regulation, still are within 

the spirits of the present invention. Thus natural mutations or synthetic equivalents also are embraced by the present 
invention. 

In yeast cells of the genus Schwannjqmyces it is also possible to use promoters derived from yeast genes obtained 
from many other yeast genera. Preferred are regulons involved in the expression of ADH1, ADH2, PDC1, GAL1/10, 
so PGK and GAPDH, or LAC4 which are obtained from Saccha_rqmyces cerevisjae, KiTOerpmyces. Lactis or Schwann^ 
omyces occ|dentaJi§. 

Furthermore", it Is possible to use viral regulons and/or terminators, for example regulons and terminators obtained 
from Exoli T-phages. The phages possess extraordinarily strong functional sequences for performance of their lytic 
functions" In a preferred embodiment sequences of phage 17 are used, which are regulated by expression of T7 RNA 
55 polymerase. 

In order to increase the over all expression level and furthermore, in order to facilitate purification of produced 
polypeptides, it is most desirable, that the polypeptides are secreted into the culture medium. The present invention 
provides signal sequences having the invaluable advantage of being derived from genes of amylolytic and thus induc- 
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ible enzymes which upon induction have to be secreted very efficiently. A further advantage is, that induction of amy- 
£te ^^doe. not require complicated media, lacking for example a specific amino acid or phosphate, but only 
S?re^ to be transferred to a starch containing medium. Signal sequences, which are preferably used for 
secretion of large proteins include part or all of at least one of the following peptides: 

a) MetArgPheSerThrGtuGLyPheThrSerLysValValAlaALalLe 
LeuAlaPheSerArgLeuValSerAta; 



b) MetArgPheSerThrGLuGLyPheThrSerLysVaLVaLALaALalLe 
LeuALaPheSerArgLeuValSerALaGLnProIlelLePheAspMetArg; 



c) MetllePheLeuLysLeuUeLysSerlLeVaLILeGLyLeuGLyLeu 
ValSerALa; 



d) KetlLePheLeuLysLeuILeLysSerUeVallLeGlyLeuGLyLeu 
ValSerAlalleGLnAlaAlaProAla; 



e) MetltePheLeuLysLeuUeUysSerUeVallLeGLyLeuGLyteu 
ValSerAtaUeGlnALaALaProALaSerSerlleGtySerSerAlaSerALa 



in a further embodiment the above mentioned peptides are coded for by DNA sequences, which correspond to ail 
or part of the following DNA sequences: 

40 

3 ) ATGAGATTTTCAACTGAAGGATTTACAAGTAAAGTTGTTGCAGCAATTTT 
AGCATTCTCAAGATTGGTATCTGCT; 

45 



b) ATGAGATTTTCAACTGAAGGATTTACAAGTAAAGTTGTTGCAGCAATTT 
TAGCATTCTCAAGATTGGTATCTGCTCAACCGATTATTTTTGACATGAGA; 



c) ATGATTTTTCTGAAGCTGATTAAAAGTATAGTAATTGGTTTGGGATTA 
GTTAGTGCT; 
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d) ATGATTTTTCTGAAGCTGATTAAAAGTATAGTAATTGGTTTGGGATTA 
GTTAGTGCTATCCAAGCAGCCCCTGCC; 

5 

e) ATGATTTTTCTGAAGCTGATTAAAAGTATAGTAATTGGTTTGGGATTA 
GTTAGTGCTATCCAAGCAGCCCCTGCCTCTTCGATTGGATCTAGTGCTTCA 

10 GCA. 

It will be understood, that the rep&cement of amino acids by residues having comparable properties and use of 
alternative codons in compliance with the possibilities provided by the genetic code will not have any impact on the 
is yeast cell according to the present invention. 

The polypeptide to be expressed in a yeast cell according to the present invention is coded for by a th.rd DNA 
sequence Said third DNA sequence may comprise natural or synthetic DNA, even DNA containing .ntervemng se- 
quen^ 

of enormous importance for scientific or medical purposes, for example cellulase, interleave, '" terte ^;^ u '' n '^ 
20 grovrthfactor,lym P hokine,humang^ 

factors, hepatitis B surface or core antigens, viral or bacterial vaccines or human granulocyte/macrophage colony 

'^ThTyeLsTcell according to the present invention may optionally comprise a terminator as component of said 
expression cassette. Preferred terminators are derived from genes of one of the following genera : 
2S P Saccharomyces,Pichia, Hansenula, bu, preferab,ySchw a nnJ OT ,ce. Examples of preferred^ 

derived"/"™ Schwanniomyces a-amylase gene or Schwann.iomy.ces glucoamyiase gene, part or a I d WhK* my be 
used. Such legators are comprised by nucleotides 1537 to 1740 of the Schwannjomyces a-amylase gene or nu- 
cleotides 2875 to 3320 of the Schwanniomyces glucoamyiase gene. 

It is obvious, that in case oTany "of the DNA sequences mentioned modifications, which do not impair the b,olog cal 
so fU nin of the respectrve DNA fragment still are covered by the present application. There are numerous poss b.l.t.es 
tochangetheD^aequerKeem^ 

The yeast cell according to the present invention, carrying an expression cassette composed of the above dis- 
cussed components, may be carried by a circular or linear vector, which optional* comprises one or more .selective 
marker genes. The present inventors demonstrated, that a lot of marker genes so far used for transformer , of 
charom'ces, for example TRP5, LEU2, ADE1 , ADE2, HIS3, HIS4, URA3, LYS2, wh.ch may be ob tamed W£$£ 
"romyVesHansenula. Pichia or Schwanniomyces, are functional in Schwann,omy.c A s.c^.dentalis Furthermore it .s 
a^p^iE.tetoLUttira^^"to Arties of o«myla«o or gluooamylase in orter to select t ™ l ^""2^E , ^ 
The corresponding genes have been isolated and characterized. AMY1 and GAM1 are preferably obtamed from 

endogenous yeast chromosomes. Furthermore, there is however the possibility, that a DNA sequence capable^ of ^on- 
trolling autonomous replication and even also stable maintenance of the vector ,n the host organism, .s introduce hnto 
he least cell Said DNA sequences are the so called ARS sequences, capable of controlling autonomous replicate 
^S!nJ!^ in She respective host organism. Said DNA sequences are preferably derived from Saccha- 
45 romyces, Pichia Hansenula or KJjuyyeromyces, most preferred from Schwannjomyjp 

" " In" o'rder toVn'surVeYective replication in the prospective host organism, it .s preferred to use an ARS, derived from 

auton^ou^ 

so of SwARS has not yet been determined. SwARS2 is localised in the .Schwannjomyces genome in the 3 reg.on of the 
Scati of £y plasmid, exhibiting this sequence. The so far known 

sequences SwARSI and SwARS2 are comprised on a Clal-BamHI fragment (SwARSI) ad|acent to the EcoRI-frag- 
ment sho^n in Fig 2a (see Fig. 10) and/or an EcoRI-Bglll fragment contained in the EcoRI fragment shown ,n F,g 2a 

55 ^ofthe SwARS sequences it is also no. necessary, in any case to use the sequencers it occurs naturally DNA 
sequences, that have been obtained by any kind of modi.ying treatment, for example by deletion, insertion or subst - 
ttf?on of one or more nucleotides or base pairs, are still within the scope of the present invention, provided the mod,- 
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fications retain or improve the biological function of the yeast cell according to the present invention. 

In a preferred embodiment the yeast cell according to the present invention comprises a DNA sequence, which is 
homologous to genomic Schwann iomyces DNA. Provision of structures of homologous DNA enable the DNA fragment 
flanked by such homotogous "sequences to be inserted by homologous recombination. It is thus preferred to have 
homologous sequences, which do not represent any essential gene of the Schwannjqmyces genomic DNA but rather 
any gene, whose expression product may be supplemented by additives to the respective medium or were lack of the 
respective expression product does not have any serious impact on the transformed yeast cell. 

If no SwARS sequence or any equivalent sequence, for example originating from any other yeast species, is 
available the expression cassette can be used without an autonomously replicating agent. In this case the expression 
cassette tends to integrate into the genome, for example via homologous recombination. Preferably there are several 
copies inserted into one or more chromosomes. 

In this case it is preferred, to have multiple copies of the expression cassette inserted, in order to ensure simulta- 
neous transcription and translation of several copies. 

When the yeast cell according to the present invention is derived from a yeast of the gen us Schwann] omyces the 
polypeptide products observed following expression of the respective foreign gene in said yeast cells exhibit g yco- 
sylation patterns, which closely resemble or even match those of the desired eucaryotic protein. Yeast cells of the 
qen us Schwannromyces are thus preferred for expression of eucaryotic proteins. 

3 r . ,. /Man Ha i icaH in a nrnriOSR forthe DrodUCtlon Ot 

The above discussea yeast cen accoiuiny iu mo ^iBoom nivoiiiiv.1 — - — ■ — i •-■ 

a polypeptide, wherein a yeast host organism is cultivated under suitable conditions and the polypeptide is recovered 
in a manner known per se. The conditions of cultivation depend on the respective host organism used. Examples for 
cultivation conditions"for"yeast of the genus Sch wan njqmyces are provided in the examples below. The POVPeP^ 9 - 
produced by the yeast according to the present invention, is recovered by conventional purification methods, which 
depend on the properties of the expressed protein, the availability of specific absorbing agents and so on. 

In a preferred embodiment yeast cells of the species Schwannjqmy.CAS opcidentaHs are used as a host organism 
and are cultivated on starch.dextrin, maltose and/or plant biomass as the respective carbon source. As commonly 
known, these carbon sources are available at low costs. 

Furthermore, there is a possibility to use cells of the genus Schwannjqmyces for the production of single cell 
protein The single cell protein may then be subjected to any further use as it is required. 

In a preferred embodiment the polypeptide produced in a process according to the present invention is an arnylolytic 
enzyme for example an o-amylase or glucoamylase. These enzymes are commercially used to debranch and degrade 
starch, which thereupon may be further processed, for example in brewery or baking processes. There is a high need 
for the provision of starch degrading enzymes! 

In a further embodiment of the process according to the present invention production of a polypeptide may be 
achieved by providing expression cassettes as discussed above containing a yeast cc-amylase or glucoamylase signal 
sequence and inserting the respective expression cassette into a yeast of one of the genera Saccharoses, ny z 
veromyces Hansenula, Pichia or Schizosaccharomyces. Production and secretion of foreign proteins in the respective 
yeast host'o'rganism is" obtafned by applying the appropriate inductive measure, whereupon the produced prote.n is 
secreted due to the presence of the yeast cx-amylase or glucoamylase signal sequence. Since it is known from exper- 
iments performed by the present inventor, that these signal sequences are recognised in virtually any yeast genus, 
the a-amylase or glucoamylase signal sequences may be applied in many different yeast with success. Also in th.s 
case, it is preferred, to use the a-amylase or glucoamylase signal sequences derived from the respective genes of 
Schwanniomyces occidentalism 

" " " ln"s"ummary"the" yeast cells according to the present invention are well suited for transformation and expression 
of any desired foreign gene . As already outlined above, expression of procaryotic as well as eucaryotic genes is 
possible. However, for expression of eucaryotic proteins yeast cells of the genus Schwannjqmvces containing some 
or all of the DNA sequences mentioned above are the most preferred yeast cells. 

CJtAR^CJARJSATION p.F.THE a-AMYLASE AND GLUCOAMYLASE GENE EXPRESSION IN DIFFERENT YEAST 
GENe'r A " " " 

The N-termini of the deduced amino acid sequences of the a-amylase gene (Fig.lB) and glucoamylase gene (Fig. 
2B) exhibit the features of typical leader sequences of secretory proteins (von Heijne 1983, Perlman and Halvorsson 
1 983) The N-terminal amino acids of mature a-amylase secreted by S, occjde.ntalis are determined to be AspValset 
This indicates processing between Arg33 and Asp31 which is unusual and not according to the \fon Heijnes model of 
site specified for signal peptidase cleavage. This putative cleavage site was confirmed also for a-amylase expressed 
in S cerevisiae, Pichia stipidis, K. lactis, S. pombe and H. polymorpha. However, in these yeast species processing 
is afsoloundto cicur between Ala 25 and Gin 26, resulting in the N-terminal sequence GlnProlle. 

a-amylase isolated from a culture of S, occjd.entajis migrates as a single band of 55 kDa on SDS polyacrylam.de 
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aels- following Endoglucosidase H treatment and subsequent gel electrophoreses a single band of 54 kDa is o^erved 
fndfciSgZonlyo^e of the two potential^ 

thSS mannose glycosylate, as veiy often found by proteins secreted in _S_ cerevisiae is not accompanying secre- 
tion in S. occidental is. 

IDENTIFICATION OF THE TRANSCRIPTION INITATION SITES AND PROMOTER FUNCTION OF THE a-AMYLASE 
AND GLUCOAMYLASE GENE. 

By S1 mapping of polyA-mRNA (Sharp et al., 1980) the transcription initiation site of the a-amylase gene . could 
be localised to be in the region of -42 to -30 (indicated by an interrupted line above the relevant sequence in Fig.lB) 
Sc u£ ^ transcription initJon site of the glucoamy.ase gene was determined to be in the region -7 to -10 (undented 
in Fig.28). 

CONSTRUCTION OF AN a-AMYLASE - GLUCOAMYLASE HYBRID GENE 

in order to facilrtate use of the a-amylase promoter and glucoamylase structural gene in further constructions 
addi onau section sKes were inserted in front of the translation initiation codon of both genes using 

. . . . — ^ ;u„w ^ amprsham nrntonol. Correct mutaaenesis was verified by sequencing 

ac^ngToTa^ 
are shown below : 



Of.- amylase : 

** * 

5 ■ . .TAAAATAAAAGCTC6AGATG_AG.AT.TT. .. .3 ' 

Xhol 



Glucoamylase 



* ****** ** 

5' ..CTCATGACTGTGTCGACGGATCCAAG A TG_A,IIIJJ-"--^ ' " 
_Sa_l_I._E__i_!ii 

The manipulations resulted in plasmids M1 3aXho (Fig.3) and pMaGAM-B/S (Fig 4). 

Thus, both the a-amylase promoter and the structural glucoamylase gene can be nsoiaed f ™ 
and pMaG AM-B/S as suitable DN A fragments. The GAM gene was isolated from pMaGam B/S as a '^^^^ 
a^dSed into the Xhoi site of M13aho. The Bgl.l fragment harbouring the a-amylase Promoter/GAM 9 ne to 
was ifqated into the single BamHI site of the S, cerevisiae vector YRp7 (Struhl et al., 1979) resuft.ng .n plasmid 
YR P 7«GAM (Rg 5). S. cerevisiae strain YNN27 was transformed wrth this p.asmid selectmg for TRP prototrophy 
lansSants secJetVdVcrivV gfucoamylase into the culture supernatant (50 mU/m.) ind.cat.ng that the a-amylase 
promoter fragment can direct expression and secretion of foreign proteins. 

SECRETION OF a-AMYLASE AND GLUCOAMYLASE IN DIFFERENT YEASTS USING SUITABLE PROMOTERS. 

Expression of the a-amylase and glucoamylase gene and secretion of the gene products can be detected in K. 
Iacti ZSrZiro of the promoter of the a-ga.actosidase (LAC4) gene (Breunig et al.. 1984). Expression of the a- 
! S$£Z£L*m^ 0-n- and secretion of the both proteins can also be detected in S pg^e under control 
of theADHI promoter (Russel etal., 1983) or of the GAL1/10 promoters (Johnston and Davis 1984). 
After fusion with the GAL1/10 promoter (Johnston and Davis 1984), DHAS promoter (Janowicz et al., 1985), MOX 
pTomoS^^^^^^ 
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gene and secretion of both proteins can be detected in Hansejiuja eqtymorpha. 

EXPRESSION AND SECRETION STUDIES IN S. OCCJDENTALIS USING THE CELLULASE ENDOGLUCANASE D 
^E^D^F CLOSTRIDIUM JHER_MQCELLUM AS A MODEL FOR FOREIGN GENE EXPRESSION. 

As a first example for the expression o. a foreign gene in & occid.entajis, the celD gene from £ JhfflTOStgB! 
(Millet et al„ 1985, Joliff et al., 1986b), which codes for a thermostable cellulase, namely an endoglucanase (EGD) 
was used. Advantages in using this system are : 

- specific reaction of EGD, which can easily be monitored 

- a rapid qualitative assay for colonies with EGD activity (see material and methods) 

- availability of antibodies against EGD for specific detection of translation products. 

CHARACTERISATION OF THE GLU COAMYLASE PROMOTER AFTER FUSION WITH THE celD GENE 

Plasmids were constructed containing a replicon from S 4 occid.entajis (SwARSI ) (EP 
from S cT evisae (Za.kin and Yanowski.1982) as a selective marker and, in addttion. > QAKtoDge^H under 

' v . . „_.„. nifhPrnRi-Pwullfraament from piasmid pBRSwARS GAM (Fig. 

the control oi aiiieremprcwnoittia. in mom oi*»oK - - „„,4»h«v™t«riai 

Z was isolated and inserted into the 2296 bp pBR322EcoRI-Pvull fragment, containingmeampgeneandthebacte^ 

oTir resulting in piasmid pBRPGAM (Fig.6a). In addition to the P BR322 sequence the piasmid carries 3.6 kb from 
°he 5'' non cSng fegion of the GAM gene and the first 208 bp coding for the N-terminal part (includmg the signal 
sequence) o7th7g^ 

region of the celD gene missing the 5' 111 bp was inserted into the Pvull single siteof pBRPGAM resulting in pBRGCI 
6b) E. co.i transformed wKh this piasmid results in weak Carboxymethylcel.ulase (CMCase) activrty analyse by 

congo red seining (see materia, and methods) indicating that ^*f^^S7S 871 So7Z £av£ 
coli For the construction of a S. occidentalis expression vector the piasmid pCJD5-1 (EP 87110370J) was cieayeo 

"wlhBamSpTrd^ 

^upo r 1 .Fia 6c> and DMPGC1 -2 (Fig 6d), respectively. Both plasmids share an in frame fusion of GAM/celD but 
Sff P e?Se".eSh oHh' sC^arn region of the GAM gene, a occidentalis strain NGA58 (trp5 ade) isolated after 
tT^eu^TJZ^2 was transformed with these plasmids and trans.ormants selected for tryptophan 
ZSSm^ ^ means o, congo red assay for cellulase activity. Transformants produced actrve EGD 
Kr g rolTnder inLingconditions in 2% maltose (Fig 7a). By analysis ^^^^2^. 
be detected under repressed conditions in 4% glucose (Fig. 7b). NGA58:pMPGC1-1 and NGA58.pMPGC1-2 trans 
%ZS£wZ^b EGD into the culture supernatant when grown in maltose under induced condrtions. The infect 
S^^SScSl^ promoter resides on piasmid pMPGCI -2 as shown by 20 times higher actmty compared to 
Lrr P Cote7! ra gment in gene fusion of pMPGCM (table 1). However the 0.3 kb 
harbours regulatory units which lead to catabolite repression by glucose (Fig.7b . Comparison of ^ N— GAM 
seouence to the signal peptidase cleavage sites of several other secretory proteins (Von He.|ne 1983) reveals three 
oo?entS s gnal ^peSdase c.eavage sites after the amino acid residues Ala 19, Ala 25 and Ala 34. For determinate 

Howeve the N-terminus is blocked and therefore could no. be sequenced. To confirm the correct cleavage ^s.te of the 
GAM sSnarsequence in S. omental*, the regions between the potentfcl signal peptidase <?W »^ «"» 
Pvull gene fusion stte are d"eleted in the GAM/celD fusion by in vilrg mutagenes 1S to study the influence on secretion 
of active EGD. 

STUDIES OF THE USE OF SUITABLE S^CEREyiSIAE PROMOTERS IN THE EXPRESSION OF FOREIGN GENES 
IN S_ OCCJDENTALIS. 

The ADH1 promoter (Hteeman etal., 1981) was isolated as a 1.45 kb BamHI-Sall fragmentfrom piasmid P FM2-1 
(MOlLr efal 9 P a Tand I gated into piasmid pCJD5-1 (EP 87 110370.1) after cleavage with BamHI and Sal. resu ,ng 

n oSaDHI Fig 8a) The GAM gene was excised as a Sail fragment from pJDcg-15 (see below) and inserted nto 
L Se Sal s te o7pMPADH1 (Fig. 8b) resulting in piasmid pMPAG (Fig. 8b). By the means of i_n vivo recombination 

Ma et a X^aJE^r'lJ pMPGCI -2 (Fig 6d.) was replaced by the ADH1 prater o pMPAG laading 
!o iSmld pMPAGC (Fig. 8c). Piasmid pMPGC1-2 was linearized w*h BamHI; 

is localized between the TRP5 coding sequence and the glucoamylase promoter. The 2.75 kb Clal fragment at pMhwu 
c^nt« 

Ss F^in vivo recombination NGA58 was transformed with BamHI linearized pMPGC1-2 together w,th approx. 10 
fold excess of Clal fragment using the transformation protocol of Ito et al., 1983. 
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Recombinants were screened for their ability to hydrolyze CMC on 4% glucose. Under these oomMansoelD ex- 
presses repressed in informants harbouring ^circularized pMPGCt -2. From 146 transformants 7 <?«w>e*r 
Zel ng ce uLe in the presence o, 4% glucose were isolated. NGA58 containing the ^ vjvo ""rtnxM 
jMPAGCexpressesactiveEGDconstitutive^see Fig. 7).Thisresultdemonstrates that the newly constructed plasm,d 

of the SceiD ge e fusion under control of the PDC promoter (Das and Hollenberg, » 
Sphl Aval fragment from plasmid P JDcg-15 (EP 87 110370.1) carrying the glucoamylase gene fused to the PDC pro- 
moter was subcloned into the respective restriction sites of YR P JD2 (EP 87 110370.1) resulting .n plasm,d pMPPG 

(R9 F ?h vivo recombination pMPPG was cleaved with Clal and used together with BamHI linearized P^™£** 
transformation of NGA58. Homologous recombination results in plasmid pMPPGC (F* 9b) harbouring GAM«e ID 
fusion under control of the PDC promoter. This construction leads to an expression ,n Schwannomas ocSLdentahs 

d^ct^ve EGD using the qualitative con go red assay (Fig. 7). The results indicate that the 0.3 kb G ^ ^° m ^L 

leads to regulated expression, however less efficient that the 1 .3 kb fragment (for comparison see table 1 and Fig. 2A). 

the ADH1 promoter leads to efficient consttut^e expression. The expression level ,s comparable 
with that of induced expression directed by the i .3 kb GAM premolar. 

Zfact that plasmids can be constructed by in vivo recombination is a proof for homologous recombination and 
henc^offS the possibility for stable site directed integration into ^ 

industrial production of foreign proteins. nraducts in S 

The expression of the a-amylase gene and glucoamylase gene and secret™ of the both gene products m§ A 
occidental could also be achieved under control of the phosphoglycerate kinase (PGK) promote (Dobson e t a 
^T^cl^^osp^ dehydrogenase (GAPDH) promoter (Holland «VP«»** 
CUP1)prom 0 ter(Lnetal..1984,Buttetal.,19a4 ) andalcoholdehydrogenase2(A^ 
et al., 1983, Beier et al., 1985) or galactokinase (GAL1/10) promoter (Johnston and Davis. 1984). 

Construction of pJDcg-15 : 

The GAM gene was isolated as a 3.2 kb BamHI-Sall Defragment from plasmid pMaGAM B/S (Fig.4) and inserted 
into the respective restrfction sites of pfesmid P BM272 Johnston & Davis, 1 984). The PDC promoter was isolated from 
Z^Z^^n a Hoilenberg, 1 988) as a Sal.-Sph, fragment and ligated intc » »• 
of vector YRp7 (Struhl et al.. 1979). To be able to isolate the PDC1 promoter as a BamHI ragmen the 3.2 kb SaH 
Uaqment of PMaGAM B/S containing the GAM gene was fused to the PDC promoter v.a the single Sail site The 
resulg aamHI fragment containing the PDC-promoter was ligated into the single BamHI site of pBM272 containing 
the GAM gene. The final plasmid is called pJDcg-15 (Fig. 8b). 

EXPRESSION OF HUMAN GRANULOCYTE/MACROPHAGE COLONY STIMULATING FACTOR (hGM-CSF) 

The gene coding for hGM-CSF (Cantrell et al., 1985) was obtained from British Biotechnology Ltd. as ; a ^"11- 
EcoRlfragment. Position + 52 of hGM-CSFgene was fused to position. 102 of the GAM s,gnal sequence under ^ 
of the ADH1 or GAM promoter. This fusion results in a hybrid protein in which the endopept.dase cleavage seethe 
GAM signal sequence is linked with a Met included in the synthetic gene offered by Brftish Biotechnology L*£*™* 
by the ffrst amino acid of mature hGM-CSF (Ala 18). This lusion was inserted into a S pccjde.nta.is expression vector 
and transformed into NGA58, Western blot analysis revealed secretion ol hGM-CSF protein. 

EXPRESSION OF INTERLEUKINE-2 

The gene coding for IL-2 (Taniguchi et al.. 1983) was obtained from British Biotechnology Ltd.. as a Ecoffl-Hindlll 
fragment Position^ of IL-2 genewasfusedtoposit^ 

or GAM promoter. This fusion results in a hybrid protein in which the endopeptidas • f ™ S ' 9 ™ 

sequence is linked with a Met included in the synthetic gene offered by British Biotechnology Ltd. fa .Howe J 
amino acid of mature IL-2 (Ala 21). This fusion was inserted into a a pccjde.ntal.s expression vector and transformed 
into NGA58. Western blot analysis revealed secretion ol IL-2 protein. 

ISOLATION OF AN ADDITIONAL SwARS ELEMENT AND A NEW SELECTABLE MARKER 

A new SwARS sequence (SwARS2), localized downstream of the coding region of the GAM gene, could be iden- 
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tified. To analyze the function of SwARS2 a vector was constructed containing the Trps gene^ and S wARS2 in pBR322 
resulting in pMPTS2-A and pMPTS2-B. respectively (Fig. 10). For the construction of pMPTS2-A and pMPTS2-B the 
5 5 kb BamHI - Bgll fragment of pBRSwARSGAM, constructed by insertion of the EcoRI-fragment shown in F,g.2a into 
the single EcoRI site of plasmid pBRSwARSI, (disclosed in EP 87 110370.1), was isolated and ligated with 3.2 kb 
BamHI TRP5 fragment obtained from YRpJD2. High frequency transformation of Soccjd.entah.s ls obta.ned us.ng this 

^Tan additional selectable marker the LEU2 gene of & pccjde.ntaJLs was isoteted. The , LEU2 :gene, coding , for 
isopropyl malate dehydrogenase, was cloned by functional complementation of the mutant AH22 (his4leu2) (Hmnen 
et al 1 978) of S cerevisiae by transformation with a cosmid genomic library from S. occjdentalis. 

The LEU2 gene was found to reside on a 9 kb EcoRI fragment of Schwannjqmy_ces genomic DIM A; it's function 
could be abolished by insertion of the TRPS gene of S, cereyisi_ae. The disrupted L^U2 gen e w^s integrated into the 
genome of NGA58 by homologous recombination resulting in a new strain NGA561 (TRP5, ^ 
analogous experiment with the cloned HIS4 gene of a occjd.entaji.s (EP 87 110 370.1) led to a strain NGA58h (TRPS. 
HIS4, ADE). 

IMPROVEDEXPRESSION OF HETEROLOGOUS GENES USING THE T7 RNA POLYMERASE AN D T7 PROMOTER 
AND TERMINATER FOR THE EXPRESSION OF FOREIGN GENES IN _S. OCCJDENTALIS 

T7 RNA polymerase (Fuerst et al., 1986) can be expressed in _S, occjdentaljs after fusion of the structural gene 
(Dunn and Studier. 1 984) using the BamHI restriction site near the translation initation codon with suitable promoters 
asa-amylase, GAM (0.3 kb fragment) or PDC and homologous integration into the genome via any homo ogous c oned 
aene e q HIS4 LEU2 AMY1 and GAM. A suitable fragment, containing the fusion flanked by homologous cloned 
sequences can be isolated and used together with pCJD5-1 (Fig. 6c) for co-transformation of NGA58, selecting on 
YNB containing 0.5% casaminoacids. Colonies containing the integrated T7 RNA polymerase gene can be identified 
by Southern analysis after isolation of transformants mutated in the target gene. 

For the expression of foreign genes a S^occjde.ntalis expression vector was constructed conta.ning SwARS2, the 
ColE replication origin of E^coli, a suitable selective marker for a occidentals (e.g. TRPS, HIS4 LEU2) and a cassette 
containing the T7 polymerase promoter and terminator sequences isolated from plasmd pAR2529 (Fuerst et al., 1986 
separated by the polylinker sequence from M1 3mp1 9. To study the efficiency of the T7 system we expressed a GAIW 
celD fusion (GAM signal sequence and structural celD gene obtained as a BamHI-Asp 718 fragment from pMPGCI -2) 
under control of the T7 promoter by insertion into the BamHI-Asp718 site of the polylinker sequence. Us.ng this system 
active cellulase is secreted into the culture medium. 

MATERIALS. ANp_METHODS 

1. The microorganisms used are as follows : 

o 

Schwannjomyces occidentalis NGA23 : (DSM 3792) 

40 Schwannjgmy.cesjDccidentalis NGA58 

Klujyerpmyces lactis SD11 : (DSM 3795) 

Schizgs_accharomyceAr»mbe LEU1 -32, HIS5-303 (DSM 3796) 

Saccharomyces cereyisiae YNN27 (URA3-52, TRP1 -298, GAL2) 

The Esche.richia coH strain used is as follows E A col i HB101 F-. hsd S20 (r B - m B -). recA13, ara.-14, proA2., JacYJ , 
flal_K2, rpjslio (Sm0,"xy.t5, mtM , sup.E44) 

so 

2. Culture jTjedia : 

Schwanniomvces occidentalis strain NGA58 (isolated by UV mutagenesis of strain NGA23) was grown at 30° C in 
YNB7o:67% yeasfnltrogenVase without amino acids) supplemented with appropriate amino acids and with 2 A of 
55 soluble starch when indicated. . 

S cerevisiae strain YNN27 (URA3-52, TRP1-289, GAL?) (Stinchomb etal 1 980); JC Lactjs strain SD11 JRP1 (Das 
and HoTlen"bVrg","l982); S. pombe strains 'lEUI -32. HIS5-303 (obtained from W.D. Heyer): These strains were grown 
under conditions as described above for Schwannjqmy.ces occidentalis with the replacement of 2% soluble starch by 
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25 



30 



2% glucose, where indicated. iju nft . „ u e o- 

The culture media tor S. occidental and Hansenula polymorphs* were buffered with 0.2 M NaP0 4 buffer pH 6.2, 
those for & cereyisi_ae and £ factjs were buffered with 0.1 M citrate buffer pH 6 and those for a pombe with 0.05 M 

308 ThVpH optimum for the cellulase (Joliff et al., 1986a), a-amylase and glucoamylase (Wilson and Ingledew, 1982) 

' S aP E.^oli P HB101 (Bolivar et al., 1977) was grown in LB medium (Maniatis et al., 1982) with penicillin G (150 ug/ml). 

3. MisceHanepus methods 

Plasmid DNA was purified either by CsCI gradient centrifugation (Maniatis et al., 1 982) or by rapid alkaline extrac- 
tion of plasmid DNA described by Birnboim and Doly (1979). 

Yeast crude extracts were prepared by the method of Schatz (1979). Yeast minilysates were prepared by the 
method of Sherman etaL, (1983). 

DNA fragments were isolated by the "low meiting agarose" procedure as described by Gafner etaL, (1983). South- 
ern hybridization was carried out as described by Southern (1 975). 

Yeast transformation was performed according to Klebe et al. (1983) or Ito et al. (1983). The transformants or 

. . t ™^ ^roti™ ^oiinia«o nsinn fiither the aualitative conqo red assay (Teather 

iniegranis weia lw&iou iui oApioooiun cm^- o^w, wv.w. . — o - . - . 

and Wood 1 982- halo formation on carboxymethy Icellulose agar plates) or a quantitative cellulase assay according to 

Joliff et al. (1 986a). One unit (U) is defined as the amount of enzyme that released 1 umol para-nitrophenol per mm at 

60°C Western analysis was performed as described by Towbinetal. (1979). 

The transformants or integrants were tested for expression and secretion of a-amylase using either a standardized 
a-amylase enzyme test from Merck, W. Germany or a qualitative starch degradation test (halo formation on starch 
agar plates after staining with iodine), as follows : in this test the rate of formation of 2-chloro-4-nitrophenol is determined 
by photometry at 405 nm in 0.1 M potassium phosphate at 37°C. The enzyme unit (U) is defined as the amount of 
enzyme catalysing the formation of 1 Slmol 2-chloro-4-nitrophenol per min at 37 0 C. 

Glucoamylase activity was measured by a stop assay method: 
After incubation of the sample in 10% soluble starch in 0.05M KH 2 P0 4 -NaOH (pH 5.0) at 50°C the amount of glucose 
produced is determined by the glucose dehydrogenase method (system glucose Merck). The quantity of NADH formed 
is proportional to the glucose concentration. The enzyme unit (U) is defined as the amount of enzyme catalysing the 
formation of 1 umol glucose/min at 50°C. 





OD 600 nm 


EDG-activity mU/ml 


NGA58: 


2.3 
5.3 


0.73 


NGA58: 

pMPGC1-2 (1.3 kb GAM promoter) 


2.0 

7.9 


1.83 


NGA58: 


3.0 
5.9 


0.52 
1.07 


NGA58: 

pMPPGC 

P DC 1 -promoter 


2.2* 
4.5* 
2.9 
7.2 


0.64* 

0.19 

0.31 


Transformants were grown in YNB, 0.5% Casaminoacids, 2% maltose, 20 mg/l adenine and 0.2 M NaP buffer pH 6.2 



35 



40 



45 



50 



: values for transformants grown in 4% glucose 
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40 Claims 

Claims for the following Contracting States : AT, BE, CH, LI, DE, FR, GB, GR, IT, LU, NL, SE 

45 1 . A yeast cell of the genus Schwanniomyces , 
characterized in that 

said yeast cell contains at least one expression cassette comprising : 

a) a first DNA sequence serving as a regulon, 

so 

b) optionally a second DNA sequence coding for a signal peptide 

c) a third DNA sequence coding for a foreign protein, and 

ss d) optionally a fourth DNA sequence serving as a terminator, 

wherein the first, second and fourth DNA sequence are derived from yeast. 
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2. The yeast cell according to claim 1 , 
characterized in that 

said first and/or second and/or fourth DNA sequence are derived from a gene coding for an amylolytic enzyme. 

3. The yeast cell according to claim 2, 
characterized in that 

said amylolytic enzyme is an a-amylase or glucoamylase. 

4. The yeast celt according to any of claims 1 to 3, 
characterized in that 

said first and/or second and/or fourth DNA sequences are derived from a yeast of a member of the genera Debar- 
iomyces , Saccharomvcopses , Lipomyces, Pichia , or Saccharomvces , preferably Schwann iomvces. 

5. The yeast cell according to claim 4, 
characterized In that 

said yeast is Schwann iomvces occidental is. 

« jho yeast cell according to any of claims 1 to 5 ? 
characterized in that 

said first DNA sequence h rrmrri^ HK y* 1 n^^n.Yhgi-fragmflnt if derived from Schwanniomvces occidental is 
a-amylase gene, or a 1.3 kb BamHI-Pvull fragment if derived from Schwann iomvces occidentalis glucoamylase 
gene. 

7. The yeast cell according to claim 5, 
characterized in that 

said first DNA sequence is comprised by part or all of the DNA sequence corresponding to bases -1 to -540 of the 
region proceeding the structural gene coding for a-amylase or pari or all of the DNA sequence corresponding to 
bases -1 to -320 of the region preceeding the structural gene coding for glucoamylase. 

8. The yeast cell according to any of claims 1 to 5, 
characterized in that 

said first DNA sequence corresponds to one of the regulons regulating expression of: 

a) ADH1.ADH2, PDC1, GAL1/10, PGK, GAPDH, wherein the regulon is preferably obtained from Saccharo- 
mvces cerevisiae, 

b) LAC4, wherein the regulon is preferably obtained from Kluvveromyces iactis, or 

c) corresponding regulons, preferably obtained from Schwanniomyces. 

9. The yeast cell according to any of claims 1 to 8, characterized in that said second DNA sequence is coding for 
all or part of at least one of the following peptides : 

a ) Met ArgPheSerThrG luG lyPheThr SerLy sVa 1 Va lAlaAl al le 
LeuAlaPheSerArgLeuValSerAla; 



b) MetArgPheSerThrGluGlyPheThrSerLysValValAlaAlalle 
LeuAlaPheSerArgLeuValSerAlaGlnProIlellePheAspMetArg; 
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c ) Met ilePheLeuLysLeuI leLysSerlleVa II leGlyLeuGlyLeu 
ValSerAla ; 



d ) Metl lePheLeuLy sLeuI leLysSer I leVa 1 1 leGlyLeuG ly Leu 
ValSerAlalleGlnAlaAlaProAla; 



e) MetllePheLeuLysLeuIleLysSerlleVallLeGlyLeuGlyLeu 
ValSerAlalleGlnAlaAlaProAlaSerSerlleGlySerSerAlaSerAla 



10. A yeast cell according to claim 9. 
characterized In that 

the DNA sequence coding for said peptide corresponds to part or all of one of the following DNA sequences: 

a ) ATGAGATTTTCAACTGAAGGATTTACAAGTAAAGTTGTTGCAGCAATT 
TTAGCATTCTCAAGATTGGTATCTGCT , 



b) ATGAGATTTTCAACTGAAGGATTTACAAGTAAAGTTGTTGCAGCAATTT 

TAGCATTCTCAAGATTGGTATCTGCTCAACCGATTATTTTTGACATGAGA; 



c ) ATGATTTTTCTGAAGCTGATTAAAAGTATAGTAATTGGTTTGGGATTA 
GTTAGTGCT ; 



d ) ATGATTTTTCTGAAGCTGATTAAAAGTATAGTAATTGGTTTGGGATTA 
GTTAGTGCTATCCAAGCAGCCCCTGCC ; 

e ) ATGATTTTTCTGAAGCTGATTAAAAGTATAGTAATTGGTTTGGGATTA 
GTTAGTGCT AT CCAAGC AG CCCCTGCCTCTTCGATTGGATCTAGTGCTTCA 

GCA. 

11. The yeast cell according to any of claims 1 to 10, 
characterized In that 

said third DNA sequence coding for a foreign protein is a natural or synthetic DNA sequence encoding a cellulase, 
interleukin, insulin like growth factor, interferon, lymphokine, human growth factor, nerve growth factor, aprotinin, 
insulin, hirudin, hormones, blood clotting factors, hepatitis B surface or core antigens, viral or bacterial vaccines 
or human granulocyte/macrophage colony stimulating factor. 

12. The yeast cell according to any of claims 1 to 11 , 
characterized in that 



17 



EP 0 394 538 B1 



said fourth DNA sequence originates from a yeast of a member of the genera Saccharomvces, Pichia , Hansenula , 
preferably Schwanniomvces . 

13. The yeast cell according to claim 1 2, 
5 characterized in that 

said fourth DNA sequence corresponds to part or all of the terminator comprised by nucleotides 1537 to 1740 of 
the Schwanniomvces oc-amylase gene or part or all of the terminator comprised by nucleotides 2875 to 3320 of 
the Schwanniomvces glucoamylase gene. 

10 14. The yeast cell according to any of claims 1 to 1 3, 
characterized in that 

said first and/or second and/or third and/or fourth DNA sequence has been modified by insertion or deletion or 
substitution of one or more nucleotides while the biological activity of the DNA sequence is retained or improved. 

1$ 15. The yeast cell according to any of claims 1 to 14, 
characterized In that 

said expression cassette is carried by a circular or linear vector, which optionally comprises one or more selective 
marker genes. 

20 1 6. The yeast cell according to claim 1 5, 
characterized in that 

said vector comprises any of the following selective marker genes : TRP5, LEU2, ADE1, ADE2, HIS3, HIS4, URA3, 
LYS2, preferably obtained from Saccharomvces, Hansenula Pichia or Schwanniomvces , AMY1 or GAM1, prefer- 
ably obtained from Schwanniomvces. 

25 

17. The yeast cell according to any of claims 15 and 16, 
characterized in that 

said vector further comprises a DNA sequence capable of controlling autonomous replication and stable mainte- 
nance of the vector in the host organism. 

30 

18. The yeast cell according to claim 17 
characterized in that 

said DNA sequence is derived from the genome of a member of the genera Saccharomvces , Pichia , Hansenula 
or Kluweromvces , preferably Schwanniomvces . 

35 

19. A yeast cell according to any of claims 17 or 18, 
characterized in that 

said DNA sequence is selected from the autonomously replicating sequences ARS, preferably an ARS derived 
from the respective host organism. 

40 

20. The yeast cell according to claim 19, 
characterized In that 

said DNA sequence is SwARSI or SwARS2, capable of controlling autonomous replication and stable maintenance 
of the vector in Schwanniomvces , wherein SwARSI is comprised on a Clal-BamHI fragment adjacent to the EcoRI 
45 fragment shown in Fig. 2a, and SwARS2 is comprised on an EcoRI-Bglll fragment contained in the EcoRI fragment 

shown in Fig. 2a. 

21. The yeast cell according to any of claims 17 to 20, 
characterized In that 

so said DNA sequence has been modified by deletion, insertion or substitution of one or more nucleotides, while 

retaining or improving its biological function. 

22. The yeast cell according to any of claims 1 to 21 , 
characterized In that 

55 said expression cassette is integrated into the genome, optionally in multiple copies, preferably via homologous 

recombination. 

23. A process for the production of polypeptide, wherein a yeast host organism is cultivated under suitable conditions 
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and the polypeptide is recovered in a manner known per se, 
characterized in that 

said yeast host organism is a yeast cell according to any of claims 1 to 22. 

24. The process according to claim 23 
characterized in that 

yeast cells of the species Schwann iomyces occidentalis are used as a host organism and are preferably cultivated 
on starch dextrin, maltose and/or plant biomass. 

25. The process according to any of claims 23 or 24, 
characterized in that 

yeast cells of the genus Schwanniomvces are used for the production of single cell protein. 

26. The process according to any of claims 23 to 25, 
characterized In that 

said polypeptide is amylase or glucoamylase. 

27. Use of a yeast ce!! according to any of claim 1 to 22 for transformation and expression of foreign genes. 



Claims for the following Contracting State: ES 

1 . A process for preparing a yeast cell capable of expressing foreign protein, 
characterized in that 

at least one expression cassette comprising: 

a) a first DNA sequence serving as a regulon, 

b) optionally a second DNA sequence coding for a signal peptide 

c) a third DNA sequence coding for a foreign protein, and 

d) optionally a fourth DNA sequence serving as a terminator, 

is inserted into a yeast cell of the genus Schwanniomvces, wherein the first, second and fourth DNA sequence 
are derived from yeast. 

2. The process according to claim 1 , 
characterized In that 

said first and/or second and/or fourth DNA sequence are derived from a gene coding for a amylolytic enzyme. 

3. The process according to claim 2, 
characterized in that 

said amylolytic enzyme is an a-amylase or glucoamylase. 

4. The process according to any of claims 1 to 3, 
characterized in that 

said first and/or second and/or fourth DNA sequences are derived from a yeast of a member of the genera Debar- 
iomyces, Saccharomvcopses , Lipomvces , Pichia , or Saccharomvces , preferably Schwanniomvces. 

5. The process according to claim 4, 
characterized in that 

said yeast is Schwanniomvces occidentalis . 

6. The process according to any of claims 1 to 5, 
characterized in that 

said first DNA sequence is comprised by a 1 .8 kb Bglll-Xhol-fragment, if derived from Schwanniomvces occidental is 
a-amylase gene, or a 1.3 kb Bam HI-Pvull fragment if derived from Schwanniomvces o ccidentalis glucoamylase 
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gene. 



7. The process according to claim 5 P 
characterized in that 

5 said first DNA sequence is comprised by part or all of the DNA sequence corresponding to bases -1 to -540 of the 

region proceeding the structural gene coding for a-amylase or part or all of the DNA sequence corresponding to 
bases -1 to -320 of the region proceeding the structural gene coding for glucoamylase. 

8. The process according to any of claims 1 to 5, 
10 characterized in that 

said first DNA sequence corresponds to one of the regulons regulating expression of: 

a) ADH1.ADH2, PDC1, GAL1/10, PGK, GAPDH, wherein the regulon is preferably obtained from Saccharo- 
mvces cerevisiae, 

15 

b) LAC4, wherein the regulon is preferably obtained from Kluweromvces lactis, or 

i -~ui, , «u*-.ir»o/H frrvm Qrh\wanninmvrflS 

c) corresponaing reyuiuns, yiaioiauiy vuic*...w~ ..w... »»-■!.■*.-_--: 

20 9. The process according to any of claims 1 to 8, 
characterized in that 

said second DNA sequence is coding for all or part of at least one of the following peptides : 

25 a) MetArgPheSerThrGluGlyPheThrSerLysValValAlaAlalle 

LeuAlaPheSerArgLeuValSerAla ; 

30 b) MetArgPheSerThrGluGlyPheThrSerLysValValAlaAlalle 

LeuAlaPheSerArgLeuValSerAlaGlnProIlellePheAspMetArg; 



35 



40 



c ) Met I lePheLeuLy sLeuI 1 eLy s Ser I leva II leGly LeuG lyLeu 
ValSerAla ; 

d) MetllePheLeuLysLeuIleLysSerlleVallleGlyLeuGlyLeu 
ValSerAlalleGlnAlaAlaFroAla ; 



« e) MetllePheLeuLysLeuIleLysSerlleVallLeGlyLeuGlyLeu 

ValSerAlalleGlnAlaAlaProAlaSerSerlleGlySerSerAlaSerAla 



so 10. A process according to claim 9, 

tte°D^ for said peptide corresponds to part or all of one of the following DNA sequences: 

S5 a ) ATGAGATTTTCAACTGAAGGATTTACAAGTAAAGTTGTTGCAGCAATT 

TTAGCATTCTCAAGATTGGTATCTGCT , 
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b) ATGAGATTTTCAACTGAAGGATTTACAAGTAAAGTTGTTGCAGCAATTT 
TAGCATTCTCAAGATTGGTATCTGCTCAACCGATTATTTTTGACATGAGA; 

c) ATGATTTTTCTGAAGCTGATTAAAAGTATAGTAATTGGTTTGGGATTA 
GTTAGTGCT ; 

d ) ATGATTTTTCTGAAGCTGATTAAAAGTATAGTAATTGGTTTGGGATTA 
GTT AGTG CT AT C C AAG C AG C C CCTGCC ; 

€^ ATGATTTTTCTGAAGCTGATTAAAAGTATAGTAATTGGTTTGGGATTA 

GTTAGTGCTATCCAAGCAGCCCCTGCCTCTTCGATTGGATCTAGTGCTTCA 
GCA. 

11. The process according to any of claims 1 to 10, 
characterized In that 

said third DNA sequence coding for a foreign protein is a natural or synthetic DNA sequence encoding a cellulase, 
interleukin , insulin like growth factor, interferon, lymphokine, human growth factor, nerve growth factor, aprotinin, 
insulin, hirudin, hormones, blood clotting factors, hepatitis B surface or core antigens, viral or bacterial vaccines 
or human granulocyte/macrophage colony stimulating factor. 

12. The process according to any of claims 1 to 11 , 
characterized in that 

said fourth DNA sequence originates from a yeast of a member of the genera Saccharomyces, Pichia, Hansenula, 
preferably Schwann iomyces . 

13. The process according to claim 12, 
characterized in that 

said fourth DNA sequence corresponds to part or all of the terminator comprised by nucleotides 1537 to 1740 of 
the Schwanniomyces a-amylase gene or part or all of the terminator comprised by nucleotides 2875 to 3320 of 
the Schwanniomyces glucoamylase gene. 

14. The process according to any of claims 1 to 13, 
characterized in that 

said first and/or second and/or third and/or fourth DNA sequence has been modified by insertion or deletion or 
substitution of one or more nucleotides while the biological activity of the DNA sequence is retained or improved. 

15. The process according to any of claims 1 to 14, 
characterized In that 

said expression cassette is carried by a circular or linear vector, which optionally comprises one or more selective 
marker genes. 

16. The process according to claim 15, 
characterized in that 

said vector comprises any of the following selective marker genes : TRP5, LEU2, ADE1, ADE2, HI S3, HIS4, URA3, 
LYS2, preferably obtained from Saccharomyces , Hansenula Pichia or Schwanniomyces , AMY1 or GAM1 , prefer- 
ably obtained from Schwanniomyces . 

17. The process according to any of claims 1 5 and 1 6, 
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characterized In that . 

said vector further comprises a DNA sequence capable of controlling autonomous replication and stable mainte- 
nance of the vector in the host organism. 

18. The process according to claim 17 

characterized in that . 

said DNA sequence is derived from the genome of a member of the genera Saccharomyces, Pichia, Hansenula 
or Kluweromvces , preferably Schwanniomvces. 

19. A process according to any of claims 17 or 18, 

characterized in that . 
said DNA sequence is selected from the autonomously replicating sequences ARS, preferably an ARS derived 

from the respective host organism. 

20. The process according to claim 1 9, 
characterized in that 

said DNA sequence is SwARSI or SwARS2, capable of controlling autonomous replication and stable maintenance 

o-i ; — ^h^roin RwARfii \r rnmnrised on a Clal-BamHi fragment adjacent to the EcoRI 

fragment shown in Fig. 2a, and SwARS2 is comprised on an EcoRl-Bglll fragment contained in the EcoRI fragment 
shown in Fig 2a. 

21. The process according to any of claims 17 to 20, 
characterized in that 

said DNA sequence has been modified by deletion, insertion or substitution of one or more nucleotides, while 
retaining or improving its biological function. 

22. The process according to any of claims 1 to 21 , 
characterized in that 

said expression cassette is integrated into the genome, optionally in multiple copies, preferably via homologous 
recombination. 

23. A process for the production of polypeptide, wherein a yeast host organism is cultivated under suitable conditions 
and the polypeptide is recovered in a manner known per se, 

characterized in that 

said yeast host organism is a yeast cell according to any of claims 1 to 22. 

24. The process according to claim 23 

characterized in that .iw^h 
yeast cells of the species firhwanniomvces occidentalis are used as a host organism and are preferably cultivated 

on starch dextrin, maltose and/or plant biomass. 

25. The process according to any of claims 23 or 24, 
characterized in that 

yeast cells of the genus Schwanniomvces are used for the production of single cell protein. 

26. The process according to any of claims 23 to 25, 
characterized In that 

said polypeptide is amylase or glucoamylase. 

27. Use of a yeast cell according to any of claim 1 to 22 for transformation and expression of foreign genes. 

Patentanspruche 

Patentanspruche fur folgende Vertragsstaaten: AT, BE, CH, LI, DE, FR, GB, GR, IT, LU, NL, SE 

1. Hefezelle der Gattuno Schwanniomvces, dadurch gekennzeichnet, daG die Hefezelle mindestens eine Expressi- 
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onskassette enthalt, umfassend: 

a) eine erste DNA-Sequenz, die als Regulon dient, 
5 b) gegebenenfalls eine zweite DNA-Sequenz, die fur ein Signalpeptid kodiert, 

c) eine dritte DNA-Sequenz, die fur ein Fremdprotein kodiert, und 

d) gegebenenfalls eine vierte DNA-Sequenz, die als ein Terminator dient, 

10 

wobei die erste, zweite und vierte DNA-Sequenz von Hefe abgeleitet sind. 

2. Hefezelle nach Anspruch 1 , dadurch gekennzeichnet, daB die erste und/oder zweite und/oder vierte DNA-Sequenz 
von einem fur ein amyiolytisches Enzym kodierenden Gen abgeleitet sind. 

1S 

3. Hefezelle nach Anspruch 2, dadurch gekennzeichnet, daB das amylolytische Enzym eine ot-Amylase Oder Gluco- 
amylase ist. 

4. Hefezelle nach einem der Anspruche 1 bis 3, dadurch gekennzeichnet, daB die ersten und/oder zweiten und/oder 
20 vierten DNA-Sequenzen von einer Hefe eines Angehorigen der Gattunaen Debariomvces , Saccharomvcopses, 

Lipomvces , Pichia oder Saccharomyces, bevorzugt Schwanniomyces , abgeleitet sind. 

5. Hefezelle nach Anspruch 4, dadurch gekennzeichnet, daB die Hefe Schwanniom yces occidentals ist. 

25 6. Hefezelle nach einem der Anspruche 1 bis 5, dadurch gekennzeichnet, daB die erste DNA-Sequenz von einem 
1,8 kb Bglll-Xhol-Fragment umfaBt ist, wenn sie vom Schwanniomyces occidentalis -a-Amylasegen abgeleitet ist, 
oder von einem 1,3 kb Bam HI-Pvull-Fragment, wenn sie von einem Schwanniomyces occidentaiis-Glucoamyla- 
segen abgeleitet ist. 

30 7. Hefezelle nach Anspruch 5, dadurch gekennzeichnet, daB die erste DNA-Sequenz von einem Teil oder der ge- 
samten DNA-Sequenz umfaBt ist, die den Basen -1 bis -540 des Bereiches entspricht, der dem fur a-Amylase 
kodierenden Strukturgen vorangeht, oder einem Teil oder der Gesamtheit der DNA-Sequenz, die den Basen -1 
bis -320 des Bereiches entspricht, der dem fur Glucoamylase kodierenden Strukturgen vorangeht. 

35 8. Hefezelle nach einem der Anspruche 1 bis 5, dadurch gekennzeichnet, daB die erste DNA-Sequenz einem der 
Regulons entspricht, die die Expression regulieren von 

a) ADH1, ADH2, PDC1, GAL1/10, PGK, GAPDH, wobei das Regulon bevorzugt von Saccharomvces cerevi- 
siae erhalten ist, 

40 

b) LAC4, wobei das Regulon bevorzugt von Kluvveromvces lactis erhalten ist, oder 

c) entsprechenden Regulons, bevorzugt aus Schwanniomyces erhalten. 

45 9. Hefezelle nach einem der Anspruche 1 bis 8, dadurch gekennzeichnet, daB die zweite DNA-Sequenz fur die Ge- 
samtheit oder einen Teil mindestens eines der folgenden Peptide kodiert: 

a) MetArgPheSerThrGluGlyPheThrSerLysValValAlaAlalle 
50 LeuAlaPheSerArgLeuValSerAla ; 

S5 b) MetArgPheSerThrGluGlyPheXhrSerLysValValAlaAlalla 

LeuAlaPheSerArgLeuValSerAlaGlnProIlellePheAspMetArg; 
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c) MetllePheLeuLysLeuIleLysSerlleVallleGlyLeuGlyLeu 
ValSerAla; 



d) MetllePheLeuLysLeuIleLysSerlleVallleGlyLeuGlyLeu 
ValSerAlalleGlnAlaAlaProAla ; 



e) MetllaPheLeuLysLeuIleLysSerlleVallLeGlyLeuGlyLeu 
ValSerAlalleGlnAlaAlaProAlaSerSerlleGlySerSerAlaSerAla 

■iq [-j s {Q^Qj|Q «ach Anspruch 9, dadurch gekennzeichnet, da!3 die fur das Peptid kodierende DNA-Sequenz einem Teil 
Oder der Gesamtheit einer der folgenden DNA-Sequenzen entspricht: 

a ) ATGAGATTTTCAA.CTGAAGGATTTACAAGTAAAGTTGTTGCAGCAATT 
TTAGCATTCTCAAGATTGGTAXCTGCT , 

b ) ATGAGATTTTCAACTG AAGGATTTACAAGTAAAGTTGTTGCAGCAATTT 
TAGCATTCTCAAGATTGGTATCTGCTCAACCGATTATTTTTGACATGAGA; 



C ) ATG ATTTTTCTG AAG CT G ATTAAAAGTATAGTAATTGGTTTGGGATTA 
GTTAGTGCT; 

d ) ATGATTTTTCTGAAGCTGATTAAAAGTATAGTAATTGGTTTGGGATTA 
GTTAGTGCTATCCAAGCAGCCCCTGCC ; 

e ) ATGATTTTTCTGAAGCTGATTAAAAGTATAGTAATTGGTTTGGGATTA 

GTTAGTGCTATCCAAGCAGCCCCTGCCTCTTCGATTGGATCTAGTGCXTCA 
GCA- 

11. Hefezelle nach einem der Anspruche 1 bis 10, dadurch gekennzeichnet, da3 die dritte DNA-Sequenz, die fur ein 
Fremdprotein kodiert, eine naturliche Oder synthetische DNA-Sequenz ist, die fur eine Zellulase, Interleukin, insu- 
linahnlichen Wachstumsfaktor, Interferon, Lymphokin, menschlichen Wachstumsfaktor, Nervenwachstumsfaktor, 
Aprotinin, Insulin, Hirudin, Hormone, Blutgerinnungsfaktoren, Hepatitis-B-Oberflachen- oder -Kernantigene, virale 
Oder bakterielle Impfstoffe oder humanen Granulocyten/Makrophagen-Kolonie-stimulierenden Faktor kodiert. 

12. Hefezelle nach einem der Anspruche 1 bis 11 , dadurch gekennzeichnet, daB die vierte DNA-Sequenz von einer 
Hef e eines Angehdrigen der Gattunaen Saccharomyces , Pichia , Hansenula , bevorzuat Schwanniomyces , stammt. 

13. Hefezelle nach Anspruch 12, dadurch gekennzeichnet, daB die vierte DNA-Sequenz einem Teil oder der Gesamt- 
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heit des Terminators entspricht, der von den Nukleotiden 1537 bis 1740 des Schwanniomvces-a-Amylasegenes 
umfaBt ist, Oder einem Teil oder der Gesamtheit des Terminators, der von den Nukleotiden 2875 bis 3320 des 
Schwanniomyces-Glucoamylasegenes umfaBt ist. 

14. Hefezelle nach einem der Anspruche 1 bis 13, dadurch gekennzeichnet, daB die erste und/oder zweite und/oder 
dritte und/oder vierte DNA-Sequenz durch Insertion oder Deletion oder Substitution eines oder mehrerer Nukleo- 
tide modifiziert worden ist, wahrend die biologische Aktivitat der DNA-Sequenz beibehalten oder verbessert wird. 

15. Hefezelle nach einem der Anspruche 1 bis 14, dadurch gekennzeichnet, daB die Expressionskassette von einem 
zirkularen oder linearen Vektor getragen wird, der gegebenenfalls ein oder mehrere selektive Markergene umfaBt. 

16. Hefezelle nach Anspruch 1 5, dadurch gekennzeichnet, daB der Vektor eines der folgenden selektiven Markergene 
umfaBt: 

TRP5, LEU2, ADE1, ADE2, HIS3, HIS4, URA3, LYS2, hflvnryuat erhalten von Saccharomvces , Hansenula, Pichta 
oder Schwanniomvces , AMY1 oder GAM1, bevorzugt erhalten von Schwann iomvces. 

17. Hefezelle nach einem der Anspruche 15 und 16, dadurch gekennzeichnet, daB der Vektor weiter eine DNA-Se- 
quenz umfaBt, die die autonome Replikation und stabile Beibehaltung des Vektors in dem Wirtsorganismus steuern 
kann. 

1 8. Hefezelle nach Anspruch 1 7, dadu rch gekennzeichnet, daB die DNA-Sequenz von dem G enom eines Angehorigen 
der Gattungen Saccharomvces , Pichia , Hansenula oder Kluweromyces abgeleitet ist, bevorzugt Schwannio- 
mvces. 

19. Hefezelle nach einem der Anspruche 17 oder 18, dadurch gekennzeichnet, daB die DNA-Sequenz aus den auto- 
nom replizierenden Sequenzen ARS, bevorzugt einer ARS, die von dem jeweiligen Wirtsorganismus abgeleitet 
ist, ausgewahlt ist. 

20. Hefezelle nach Anspruch 19, dadurch gekennzeichnet, daB die DNA-Sequenz SwARSI oder SwARS2 ist, die die 
autonome Replikation und stabile Beibehaltung des Vektors in Schwanniomvces steuern konnen, wobei SwARSI 
von einem Clal-BamH I -Fragment umfaBt ist, das dem in Figur 2a gezeigten EcoRI -Fragment benachbart ist, und 
SwARS2 von einem EcoRI- Bglll-Fraqment umfaBt ist, das in dem in Figur 2a gezeigten EcoRI-Fragment enthalten 
ist. 

21. Hefezelle nach einem der Anspruche 17 bis 20, dadurch gekennzeichnet, daB die DNA-Sequenz durch Deletion, 
Insertion oder Substitution eines oder mehrerer Nukleotide modifiziert worden ist, wahrend ihre biologische Funk- 
tion beibehalten oder verbessert ist. 

22. Hefezelle nach einem der Anspruche 1 bis 21 , dadurch gekennzeichnet, daB die Expressionskassette in das Ge- 
nom integriert wird, gegebenenfalls in mehreren Kopien, bevorzugt uber homologe Rekombination. 

23. Verfahren zum Erzeugen von Polypeptid, wobei ein Hefewirtsorganismus unter geeigneten Bedingungen kultiviert 
und das Polypeptid in an sich bekannter Weise gewonnen wird, dadurch gekennzeichnet, daB der Hefeorganismus 
eine Hefezelle gemaB einem der Anspruche 1 bis 22 ist. 

24. Verfahren nach Anspruch 23, dadurch gekennzeichnet, daB Hefezellen der Art Schwanniomvces occidentalis als 
ein Wirtsorganismus verwendet werden und bevorzugt auf Starke, Dextrin, Maltose und/oder Pflanzenbiomasse 
kultiviert werden. 

25. Verfahren nach einem der Anspruche 23 oder 24, dadurch gekennzeichnet, daB Hefezellen der Gattung Schwan- 
niomvces fur die Erzeugung von Einzelzellprotein verwendet werden. 

26. Verfahren nach einem der Anspruche 23 bis 25, dadurch gekennzeichnet, daB das Polypeptid Amylase oder Gly- 
coamylase ist. 

27. Verwendung einer Hefezelle nach einem der Anspruche 1 bis 22 zur Transformation und Expression von Fremd- 
genen. 
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Patentanspruchefur folgenden Vertragsstaat: ES 

1. Verfahren zum Herstellen einer Hefezelle, die Fremdprotein exprimieren kann, dadurch gGkennzeichnet, daB min- 
destens eine Expressionskassette in eine Hefezelle der Gattuna Schwanniomvces insertiert wird, wobei die Ex- 

s pressionskassette umfaBt: 

a) eine erste DNA-Sequenz, die als Regulon dient, 

b) gegebenenfalls eine zweite DNA-Sequenz, die fur ein Signalpeptid kodiert, 

10 

c) einedritte DNA-Sequenz, die fur ein Fremdprotein kodiert, und 

d) gegebenenfalls eine vierte DNA-Sequenz, die als Terminator dient, 
is wobei die erste, zweite und vierte DNA-Sequenz von Hefe abgeleitet sind. 

2. Verfahren nach Anspruch 1 , dadurch gekennzeichnet, daB die erste und/oder zweite und/oder vierte DNA-Sequenz 
von einem fur ein amylolytisches Enzym kodierenden Gen abgeleitet sind. 

20 3. Verfahren nach Anspruch 2, dadurch gekennzeichnet, daB das amylolytische Enzym eine a-Amylase oder Glu- 
coamyiase ist. 

4. Verfahren nach einem der Anspruche 1 bis 3, dadurch gekennzeichnet, daB die ersten und/oder zweiten und/oder 
vierten DNA-Sequenzen von einer Hefe eines Angehdrigen der Gattungen Debariomvces , Saccharomycopses , 

25 Lipomvces , Pichia oder Saccharomyces, bevorzugt Schwanniomvces, abgeleitet sind. 

5. Verfahren nach Anspruch 4, dadurch gekennzeichnet, daB die Hefe Schwanniomvces occidentalis ist. 

6. Verfahren nach einem der Anspruche 1 bis 5, dadurch gekennzeichnet, daB die erste DNA-Sequenz von einem 
30 1,8 kb Bglll-Xhol-Fragment umfaBt ist, wenn sie vom Schwanniomvces occidentalis -a-Amylasegen abgeleitet ist, 

oder von einem 1,3 kb BamHI-Pvull-Fragment umfaBt ist, wenn sie vom Schwanniomvces occidentalis -Glucoa- 
mylasegen abgeleitet ist. 

7. Verfahren nach Anspruch 5, dadurch gekennzeichnet, daB die erste DNA-Sequenz von einem Teil oder der Ge- 
35 samtheit der DNA-Sequenz umfaBt ist, die den Basen -1 bis -540 des Bereiches entspricht, der dem fur a-Amylase 

kodierenden Strukturgen vorangeht, oder einem Teil oder der Gesamtheit der DNA-Sequenz, die den Basen -1 
bis -320 des Bereichs entspricht, der dem fur Glucoamylase kodierenden Strukturgen vorangeht. 

8. Verfahren nach einem der AnsprOche 1 bis 5, dadurch gekennzeichnet, daB die erste DNA-Sequenz einem der 
40 Regulons entspricht, die die Expression regulieren von 

a) ADH1, ADH2, PDC1, GAL1/10, PGK, GAPDH, wobei das Regulon bevorzuat von Saccharomyces cerevi- 
siae erhalten ist, 

45 b) LAC4, wobei das Regulon bevorzugt von Klyveromyces lactis erhalten ist, oder 

c) korrespondierenden Regulons, bevorzugt von Schwanniomvces erhalten. 

9. Verfahren nach einem der Anspruche 1 bis 8, dadurch gekennzeichnet, daB die zweite DNA-Sequenz fur die 
so Gesamtheit oder einen Teil mindestens eines der folgenden Peptide kodiert: 

a) MetArgPheSerThrGluGlyPheThrSerLysValValAlaAlalle 
LeuAlaPheSerArgLeuValSerAla ; 
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b) MetArgPheSerThrGluGlyPheThrSerLysValValAlaAlalle 
LeuAlaPheSerArgLeuValSerAlaGlnProIlellePheAspMetArg; 



c) MetllePheLeuLysLauIleLysSerlleVallleGlyLeuGlyLeu 
ValSerAla; 

d) MetllePheLeuLysLeuIleLysSerlleVallleGlyLeuGlyLeu 
ValSerAlalleGlnAlaAlaProAla ; 



e) MetllePheLeuLysLeuIleLysSerlleVallLeGlyLeuGlyLeu 
ValSerAlalleGlnAlaAlaProAlaSerSerlleGlySerSerAlaSerAla 

10. Verfahren nach Anspruch 9. dadurch gekennzeichnet, da(3 die fOr das Peptid kodierende DNA-Sequenz einem 
Teil Oder der Gesamtheit einer der folgenden DNA-Sequenzen entspricht: 

a ) ATGAGATTTTCAACTGAAGGATTTACAAGTAAAGTTGTTGCAGCAATT 
TTAGCATTCTCAAGATTGGTATCTGCT, 



b) 



ATGAGATTTTCAACTGAAGGATTTACAAGTAAAGTTGTTGCAGCAATTT 
TAGCATTCTCAAGATTGGTATCTGCTCAACCGATTATTTTTGACATGAGA; 



.GCTGATTAAAAGTATAGTAATTGGTTTGGGATTA 



c) ATGATTTTTCTGAAi 
GTTAGTGCT; 



d ) ATGATTTTTCTGAAGCTGATTAAAAGTATAGTAATTGGTrTGGGATTA 
GTTAGTGCTATCCAAGCAGCCCCTGCC ; 



e) ATGAITTTTTCTGAAGCTGATTAAAAGTATAGTAATTGGTTTGGGATTA 

GTTAGTGCTATCCAAGCAGCCCCTGCCTCTTCGATTGGATCTAGTGCTTCA 

GCA. 

11 Verfahren nach einem der Anspruche 1 bis 10. dadurch gekennzeichnet, daB die dritte DNA-Sequenz, die fur ein 
Fremdprotein kodiert, eine naturliche Oder synthetische DNA-Sequenz ist, die fOr eine Zellulase, Interleukin, insu- 
linahnlichen Wachstumsfaktor, Interferon, Lymphokin, menschlichen Wachstumsfaktor, Nervenwachstumsfaktor, 
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Aprotinin, Insulin, Hirudin, Hormone, Blutgerinnungsfaktoren, Hepatitis-B-Oberflachen-oder-Kernantigene, virale 
Oder bakterielle Impfstoffe oder humanen Granulocyten/Makrophagen-Kolonie-stimulierenden Faktor kodiert. 

12. Verfahren nach einem der Anspruche 1 bis 11 , dadurch gekennzeichnet, daB die vierte DNA-Sequenz von einer 
s Hef e eines Angehorigen der Gattungen Saccharomvces , Pichia , Hansenula , bevorzuat Schwann iomyces, stammt. 

1 3. Verfahren nach Anspruch 1 2, dadurch gekennzeichnet, daB die vierte DNA-Sequenz einem Teil oder der Gesamt- 
heit des Terminators entspricht, der von den Nukleotiden 1537 bis 1740 des Schwann iomvces -a-Amylasegenes 
umfaBt ist, oder einem Teil oder der Gesamtheit des Terminators, der von den Nukleotiden 2875 bis 3320 des 

10 Schwanniomvces -G lucoamylasegenes umfaBt ist. 

14. Verfahren nach einem der AnsprOche 1 bis 13, dadurch gekennzeichnet, daB die erste und/oder zweite und/oder 
dritte und/oder vierte DNA-Sequenz durch Insertion oder Deletion oder Substitution eines oder mehrerer Nukleo- 
tide modifiziert worden ist, wahrend die biologische Aktivitat der DNA-Sequenz beibehalten oder verbessert wird. 

15 

15. Verfahren nach einem der AnsprOche 1 bis 14, dadurch gekennzeichnet, daB die Expressionskassette von einem 
zirkularen oder linearen Vektor getragen wird, der gegebenenfalls ein oder mehrere selektive Markergene umfaBt. 

1 6. Verfahren nach Anspruch 1 5, dadurch gekennzeichnet, daB der Vektor eines der folgenden selektiven Markergene 
20 umfaBt: 

TRP5, LEU2, ADE1, ADE2, HIS3, HIS4, URA3, LYS2, bevorzugt erhalten von Saccharomvces , Hansenula, Pichia 
oder Schwanniomvces, AMY1 oder GAM1 , bevorzugt erhalten von Schwann iomyces. 

17. Verfahren nach einem der AnsprOche 15 und 16, dadurch gekennzeichnet, daB der Vektor weiter eine DNA-Se- 
25 quenz umfaBt, die die autonome Replikation und stabile Beibehaltung des Vektors in dem Wirtsorganismus steuern 

kann. 

18. Verfahren nach Anspruch 1 7, dadurch gekennzeichnet, daB die DNA-Sequenz von dem Genom eines Angehorigen 
der Gattungen Saccharomvces , Pichia , Hansenula oder Kluweromyces abgeleitet ist, bevorzugt Schwannio- 

30 myces. 

19. Verfahren nach einem der Anspruche 17 oder 18, dadurch gekennzeichnet, daB die DNA-Sequenz aus den au- 
tonom replizierenden Sequenzen ARS, bevorzugt einer ARS, die von dem jeweitigen Wirtsorganismus abgeleitet 
ist, ausgewahlt wird. 

35 

20. Verfahren nach Anspruch 19, dadurch gekennzeichnet, daB die DNA-Sequenz SwARSI oder SwARS2 ist, die die 
autonome Replikation und stabile Beibehaltung des Vektors in Schwanniomvces steuern konnen, wobei SwARSI 
von einem Clal -Bam H I -Fragment umfaBt ist, das dem in Figur 2a gezeigten EcoRI -Fragment benachbart ist, und 
SwARS2 von einem EcoRl-Bgll I -Fragment umfaBt ist, das in dem in Figur 2a gezeigten EcoRI -Fragment enthalten 

40 ist. 

21. Verfahren nach einem der AnsprOche 1 7 bis 20, dadurch gekennzeichnet, daB die DNA-Sequenz durch Deletion, 
Insertion oder Substitution eines oder mehrerer Nukleotide modifiziert worden ist, wahrend ihre biologische Funk- 
tion beibehalten oder verbessert wird. 

45 

22. Verfahren nach einem der AnsprOche 1 bis 21, dadurch gekennzeichnet, daB die Expressionskassette in das 
Genom integriert ist, gegebenenfalls in mehrfachen Kopien, bevorzugt Ober homologe Rekombination. 

23. Verfahren zum Erzeugen eines Polypeptides, wobei ein Hefewirtsorgantsmus unter geeigneten Bedingungen kul- 
so tiviert und das Polypeptid in an sich bekannter Weise gewonnen wird, dadurch gekennzeichnet, daB der Hefe- 

wirtsorganismus eine Hefezelle gemaB einem der Anspruche 1 bis 22 ist. 

24. Verfahren nach Anspruch 23, dadurch gekennzeichnet, daB Hefezellen der Art Schwann iomvces occidentalis als 
ein Wirtsorganismus verwendet werden und bevorzugt auf Starke, Dextrin, Maltose und/oder Pflanzenbiomasse 

ss kultiviert werden. 

25. Verfahren nach einem der Anspruche 23 oder 24, dadurch gekennzeichnet, daB Hefezellen der Gattung Schwan- 
niomvces fOr die Erzeugung von Einzelzellprotein verwendet werden. 
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26. Verfahren nach einem der Anspruche 23 bis 25, dadurch gekennzeichnet, daB das Polypeptid Amylase Oder Gly- 
coamylase ist. 

27. Verwendung einer Hefezelle gemaB einem der Anspruche 1 bis 22 zur Transformation und Expression von Fremd- 
5 genen. 



Rovondications 

10 

Revendications pour les Etats contractants suivants : AT, BE, CH, LI, DE, FR, GB, GR, IT, LU, NL, SE 

1. Cellule de levure du genre Schwanniomyces, caracterisee en ce que ladite cellule de levure contient au moins 
une cassette d'expression comprenant : 

15 

a) une premiere sequence d'ADN-servant de regulon, 

b) facultativement, une deuxieme sequence d'ADN codant pour un peptide signal 

c) une troisieme sequence d'ADN codant pour une proteine etrangere, et 

d) facultativement, une quatrieme sequence d'ADN servant de terminateur, 

20 

les premiere, deuxieme et quatrieme sequences d'ADN ttant obtenues a partir d'une levure. 

2. Cellule de levure selon la revendication 1 , caracterisee en ce que lesdites premiere et/ou deuxieme et/ou qua- 
trieme sequences d'ADN sont obtenues a partir d'un gene codant pour une enzyme amylolytique. 

25 

3. Cellule de ievure selon la revendication 2, caracterisee en ce que ladite enzyme amylolytique est une a-amylase 
ou la glucoamylase. 

4. Cellule de levure selon I'une quelconque des revendications 1 a 3, caracterisee en ce que lesdites premiere et/ 
30 ou deuxieme et/ou quatrieme sequences d'ADN sont obtenues a partir d'une levure d'un Element des genres 

Debariomyces, Saccharomycopses, Lipomyces, Pichia ou Saccharomyces, de preference Schwanniomyces. 

5. Cellule de levure selon la revendication 4, caracterisee en ce que ladite levure est Schwanniomyces occidentalis. 

35 6. Cellule de levure selon I'une quelconque des revendications 1 a 5, caracterisee en ce que ladite premiere se- 
quence d'ADN est constitute d'un fragment BgM-Xho\ de 1,8 kb, si elle est obtenue a partir du gene de la ct- 
amylase de Schwanniomyces occidentalis, ou d'un fragment BamHI-fVull de 1 ,3 kb si elle est obtenue a partir du 
gene de la glucoamylase de Schwanniomyces occidentalis. 

40 7. Cellule de levure selon la revendication 5, caracterisee en ce que ladite premiere sequence d'ADN est constitute 
en partie ou en totality dela sequence d'ADN correspondant aux bases -1 a -540 de la region precedant ie gene 
structural codant pour la a-amylase, ou en partie ou en totality de la sequence d'ADN correspondant aux bases 
-1 a -320 de ta region prectdant le gene structural codant pour la glucoamylase. 

45 a Cellule de levure selon I'une quelconque des revendications 1 a 5, caracterisee en ce que ladite premiere se- 
quence d'ADN correspond a I'un des regulons regulant I'expression : 

a) de ADH1, ADH2, PDC1, GAL1/10, PGK, GAPDH, le regulon etant de preference obtenu a partir de Sac- 
charomyces cerevisiae, 

so b) de LAC4, le regulon etant de preference obtenu a partir de Kluyveromyces lactis, ou 

c) de regulons correspondents, obtenus de preference a partir de Schwanniomyces. 

9. Cellule de levure selon I'une quelconque des revendications 1 a 8, caracterisee en ce que ladite deuxieme se- 
quence d'ADN code pour tout ou partie d'au moins I'un des peptides suivants : 

55 
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a ) MetArgPheSerThrGluG ly PheThrSerLysValValAlaAlal le 
LeuAlaPheSerArgLeuValSerAla ; 

b) MetArgPheSerThrGluGlyPheThrSerLysValValAlaAlalle 
LeuAlaPheSerArgLeuValSerAlaGlnProIlellePheAspMetArg; 

c) MetllePheLeuLysLeuIleLysSerlleVallleGlyLeuGlyLeu 
ValSerAla; 

d) MetllePheLeuLysLeuIleLysSerHeVallleGlyLeuGlyLeu 
ValSerAlalleGlnAlaAlaProAia; 

e) MetllePheLeuLysLeuIleLysSerlleVallLeGlyLeuGlyLeu 
ValSerAlalleGlnAlaAlaProAlaSerSerlleGlySerSerAlaSerAla 

10. Cellule de levure selon la revendication 9, caracterisee en ce que la sequence d'ADN codant pour ledit peptide 
correspond a une partie ou a la totalite des sequences d'ADN suivantes : 

a ) ATGAGATTTTCAACTGAAGGATTTACAAGTAAAGTTGTTGCAGCAATT 
TTAGCATTCTCAAGATTGGTATCTGCT , 

b) ATGAGATTTTCAACTGAAGGATTTACAAGTAAAGTTGTTGCAGCAATTT 
TAGCATTCTCAAGATTGGTATCTGCTCAACCGATTATTTTTGACATGAGA; 



c ) ATGATTTTTCTGAAGCTGATTAAAAGTATAGTAATTGGTTTGGGATTA 
GTTAGTGCT; 

d ) ATGATTTTTCTGAAGCTGATTAAAAGTATAGTAATTGGTTTGGGATTA 
GTTAGTGCTATCCAAGCAGCCCCTGCC ; 

e) ATGATTTTT CTGAAG CTGATTAAAAGTATAGTAATTGGTTTGGGATTA 

GTTAGTGCTATCCAAGCAGCCCCTGCCTCTTCGATTGGATCTAGTGCTTCA 

GCA. 

11 Cellule de levure selon I'une quelconque des revendications 1 a 10, caracterisee en ce que ladite troisi6me 
' sequence d'ADN codant pour une proteine etrangere est une sequence d'ADN naturelle ou synthetique codant 
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pour une cellulase, une interleukine, un facteur de croissance semblable a I'insuline, un interferon, la lymphokine, 
un facteur de croissance humain, un facteur de croissance des nerfs, I'aprotinine, I'insuline, I'hirudine, les hormo- 
nes, les facteurs de coagulation du sang, les antigenes de surface ou de coeur de I'hepatite B, les vaccins viraux 
ou bactenens, ou le facteur stimulant une colonie de granulocytes/macrophages humain. 

12 Cellule de levure selon Tune quelconque des revendications 1 a 11 , caracterisee en ce que ladite quatrieme 
sequence d'ADN provientd'une levured'un element des genres Saccharomyces, Pichia, Hansenula, de preference 
Schwanniomyces. 

10 13 Cellule de levure selon la revendication 12, caracterisee en ce que ladite quatrieme sequence d'ADN correspond 
' a une partie ou a la totalite du terminateur constitue des nucleotides 1537 a 1740 du gene de la a-amylase de 
Schwanniomyces, ou a une partie ou a la totalite du terminateur constitue des nucleotides 2875 a 3320 du gene 
de la glucoamylase de Schwanniomyces. 

is 14 Cellule de levure selon I'une quelconque des revendications 1 a 1 3, caracterisee en ce que lesdites premiere et/ 
' ou deuxieme et/ou troisieme et/ou quatrieme sequences d'ADN ont ete modifiees en inserant ou en deletant ou 
en substituant un ou plusieurs nucleotides tout en retenant ou en ameliorant I'activite biobgique de la sequence 

d'ADN. 

20 15 Cellule de levure selon I'une quelconque des revendications 1 a 1 4, caracterisee en ce que ladite cassette d'ex- 
' pression est portee par un vecteur circulaire ou lineaire, qui comprend facultativement un ou plusieurs genes de 
marqueur de selection. 

16. Cellule de levure selon la revendication 1 5, caracterisee en ce que ledit vecteur comprend Pun quelconque des 
25 genes de marqueur de selection suivants : 

TRP5 LEU2 ADE1, ADE2, HIS3, HIS4, URA3, LYS2, obtenus de preference a partir de Saccharomyces, Han- 
senula, Pichia ou Schwanniomyces, AMY1 ou GAM1 , obtenus de preference a partir de Schwanniomyces. 

17 Cellule de levure selon I'une quelconque des revendications 15 a 16, caracterisee en ce que ledit vecteur com- 
30 prend de plus une sequence d'ADN capable de controler la replication autonome et le mamtien stable du vecteur 

dans Torganisme note. 

18 Cellule de levure selon la revendication 17, caracterisee en ce que ladite sequence d'ADN est obtenue a partir 
" du genome d'un element des genres Saccharomyces, Pichia, Hansenula ou Kluyveromyces, de preference 

35 Schwanniomyces. 

19 Cellule de levure selon I'une quelconque des revendications 17 et 18, caracterisee en ce que ladite sequence 
d'ADN est choisie parmi les sequences se repliquant de facon autonome ARS, de preference une ARS obtenue 
a partir de Torganisme note respectif. 

20 Cellule de levure selon la revendication 19, caracterisee en ce que ladite sequence d'ADN est SwARSI ou 
SwARS2 capable de controler la replication autonome et le maintien stable du vecteur dans Schwanniomyces, 
dans laquelle SwARSI est constitute d'un fragment Cla\BamH\ adjacent au fragment EcoRI presente a la figure 
2a, et SwARS2 est constitute d'un fragment EccB\-Bgh\ contenu dans le fragment EcoRI presente a la figure 2a. 

21 Cellule de levure selon I'une quelconque des revendications 17 a 20, caracterisee en ce que ladite sequence 
d'ADN a ete modifiee en deletant, inserant ou substituant un ou plusieurs nucleotides, tout en retenant ou en 
ameliorant sa fonction biologique. 

so 22 Cellule de levure selon I'une quelconque des revendications 1 ft 21 , caracterisee en ce que ladite cassette d'ex- 
pression est integree dans le genome, facultativement en plusieurs copies, de preference par recombinaison no- 
mologue. 

23. Precede pour la preparation d'un polypeptide, dans lequel un organisme hote de levure est cultive dans des con- 
55 ditions appropriees et le polypeptide est recupere d'une maniere connue d'elle-meme, caracterise en ce que ledit 

organisme h6te de levure est une cellule de levure selon I'une quelconque des revendications 1 a 22. 

24. Procede selon la revendication 23, caracterise en ce que des cellules de levure de I'espece Schwanniomyces 
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occidentalis sont utilises en tant qu'organisme note et sont de preference cultivees sur de t'amidon, de la dextrine, 
du maltose et/ou de la biomasse vegetale. 

25. Procede selon Tune quelconque des revendications 23 et 24, caracterise ©n c© qu© des cellules de levure du 
s genre Schwanniomyces sont utilisees pour la production d'une seule proteine de cellule. 

26. Procede selon I'une quelconque des revendications 23 a 25, caracteris© on c© qu© (edit polypeptide est I'amylase 
ou la glucoamylase. 

10 27. Utilisation d'une cellule de levure selon I'une quelconque des revendications 1 a 22 pour la transformation et 
I'expression de genes etrangers. 

Revendications pour I'Etat contractant suivant: ES 

15 

1. Precede pour preparer une cellule de levure capable d'exprimer une proteine etrangere, caracterise en ce qu'au 
moins une cassette d'expression comprenant : 

a) une premiere sequence d'ADN-servant de regulon, 
20 b) facultativement, une deuxieme sequence d'ADN codant pour un peptide signal 

c) une troisieme sequence d'ADN codant pour une proteine etrangere, et 

d) facultativement, une quatrieme sequence d'ADN servant de terminateur, 

est inseree dans une cellule de levure du genre Schwanniomyces, les premiere, deuxieme et quatrieme sequences 
25 d'ADN etant obtenues a partir d'une levure. 

2. Procede selon la revendication 1 , caracterise en ce que lesdites premiere et/ou deuxieme et/ou quatrieme se- 
quences d'ADN sont obtenues a partir d'un gene codant pour une enzyme amylolytique. 

30 3. Procede selon la revendicatton 2, caracterise en ce que ladite enzyme amylolytique est une ct-amylase ou la 
glucoamylase. 

4. Procede selon I'une quelconque des revendications 1 a 3, caracterise en ce que lesdites premiere et/ou deuxieme 
et/ou quatrieme sequences d'ADN sont obtenues a partir d'une levure d'un element des genres Debariomycos, 

35 Saccharomycopses, Lipomyces, Pichia ou Saccharomyces, de preference Schwanniomyces. 

5. Procede selon la revendication 4, caracterise en ce que ladite levure est Schwanniomyces occidentalis. 

6. Procede selon I'une quelconque des revendications 1 a 5, caracterise en ce que ladite premiere sequence d'ADN 
40 est constitute d'un fragment Bgl\\ -Xho\ de 1 ,8 kb, si elle est obtenue a partir du gene de la a-amylase de Schwan- 
niomyces occidentalis, ou d'un fragment BamH\-Pvu\\ de 1 ,3 kb si elle est obtenue a partir du gene de la glucoa- 
mylase de Schwanniomyces occidentalis. 

7. Procede selon la revendication 5, caracterise en ce qu© ladite premiere sequence d'ADN est constitute en partie 
45 ou en totalite de la sequence d'ADN correspondant aux bases -1 a -540 de la r6gion precedant le gene structural 

codant pour la a-amylase t ou en partie ou en totalite de la sequence d'ADN correspondant aux bases -1 a -320 
de la region precedant te gene structural codant pour la glucoamylase. 

8. Procede selon I'une quelconque des revendications 1 a 5, caracterise en ce que ladite premiere sequence d'ADN 
so correspond a Tun des regulons regulant I'expression : 

a) de ADH1, ADH2, PDC1, GAL1/10, PGK, GAPDH, le regulon etant de preference obtenu a partir de Sac- 
charomyces cerevisiae, 

b) de LAC4, le regulon etant de preference obtenu a partir de Kluyveromyces lactis, ou 
ss c) de regulons correspondants, obtenus de preference a partir de Schwanniomyces. 

9. Procede selon I'une quelconque des revendications 1 a 8, caracterise en c© quo ladite deuxieme sequence d'ADN 
code pour tout ou partie d'au moins Tun des peptides suivants : 
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a) Me-tArgPheSerThrGluGlyPheThrSerLysValValAlaAlalle 
LeuAlaPheSerArgLeuValSerAla ; 

b) MeliArgPheSerThrGluGlyPheThrSerLysValValAlaAlalle 
LeuAlaPheSerArgJLeuValSerAlaGlnProIlellePheAspMetArg; 

c) MetllePheLeuLysLeuIleLysSerlleVallleGlyLeuGlyLeu 
ValSerAla; 

d) MetllePheLeuLysLeuIleLysSerlleVallleGlyLeuGlyLeu 
ValSerAlalleGlnAlaAlaProAla ; 

e ) MetillePheLeuLy sleul leLysSerl le VallLeGlyLeuGlyLeu 
ValSerAlalleGlnAlaAlaProAlaSerSerlleGlySerSerAlaSerAla 

. Precede selon la revendication 9, caracterise en ce que la sequence d'ADN codant pour ledit peptide correspond 
a une partie ou a la totalite d'une des sequences d'ADN suivantes : 

a) ATGAGATTTTCAACTGAAGGATTTACAAGTAAAGTTGTTGCAGCAATT 
TTAGCATTCTCAAGATTGGTATCTGCT , 

b ) ATGAGATTTTCAACTGAAGGATTTACAAGTAAAGTTGTTGCAGCAATTT 
TAG C ATT CT C AA G ATT G GT AT CTG CTC AA C CG ATT ATTTTTG AC ATG AG A ; 

c ) ATGATTTTTCTGAAGCTG ATTAAAAGTATAGTAATTGGTTTGGGATTA 
GTTAGTGCT; 

d) ATGATTTTTCTGAAGCTG ATTAAAAGTATAGTAATTGGTTTGGGATTA 
GTTAGTGCTATCCAAGCAGCCCCTGCC ; 

e) ATGATTTTT CTG AAG CTG ATTAAAAGTATAGTAATTGGTTTGGGATTA 
GTTAGTGCTATCCAAGCAGCCCCTGCCTCTTCGATTGGATCTAGTGCTTCA 

GCA. 
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11. Proc6d6 selon I'une quelconque des revendications 1 a 10, caracterise en ce que ladite troisieme sequence 
d'ADN codant pour une proline etrangere est une sequences d'ADN naturelle ou synthetique codant pour une 
cellulase, une interleukine, un facteur de croissance semblable a I'insuline, un interferon, la lymphokine, un tacteur 
de croissance humain, un facteur de croissance des nerfs, I'aprotinine, I'insuline, I'hirudine, les hormones, les 

5 facteurs de coagulation du sang, les antigenes de surface ou de coeur de I'hepatite B, les vaccins viraux ou bac- 

teriens, ou le facteur stimulant une colonie de granulocytes/macrophages humain. 

12. Precede" selon I'une quelconque des revendications 1 a 11, caracterise en ce que ladite quatrieme sequence 
d'ADN provient d'une levure d'un element des genres Saccharomyces, Pichia, Hansenula, de preference Schwan- 

io niomyces. 

13. Procede selon la revendication 12, caracterise en ce que ladite quatrieme sequence d'ADN correspond a une 
partie ou a la totalite du terminates constitue des nucleotides 1537 a 1740 du gene de la a-amylase de Schwan- 
niomyces, ou a une partie ou a la totalite du terminateur constitue des nucleotides 2875 a 3320 du gene de la 

is glucoamylase de Schwa nniomyces. 

1 4. Procede selon I'une quelconque des revendications 1 a 1 3, caracterise en ce que lesdites premiere et/ou deuxie- 
m 9 et/Qij troisieme et/ou quatrieme sequences d'ADN ont ete modifiees en insurant ou en deletant ou en substituant 
un ou plusieurs nucleotides tout en retenant ou en amelorant I'activite biologique de la sequence d'ADN. 

20 

15. Procede* selon I'une quelconque des revendications 1 a 14, caracterise en ce que ladite cassette d'expression 
est portee par un vecteur circulaire ou lineaire, qui comprend facultativement un ou plusieurs genes de marqueur 
de selection. 

25 16. Procede selon la revendication 1 5, caracterise en ce que ledit vecteur comprend I'un quelconque des genes de 
marqueur de selection suivants : 

TRP5, LEU2, ADE1, ADE2, H!S3, HIS4, URA3, LYS2, obtenus de preference a partir de Saccharomyces, Han- 
senula, Pichia ou Schwanniomyces, AMY1 ou GAM1 , obtenus de preference a partir de Schwanniomyces. 

30 17. Procede selon I'une quelconque des revendications 15 a 16, caracterise en ce que ledit vecteur comprend de 
plus une sequence d'ADN capable de controler la replication autonome et le maintien stable du vecteur dans 
Porganisme hdte. 

18. Proc6de selon la revendication 17, caracterise en ce que ladite sequence d'ADN est obtenue a partir du genome 
35 d'un element des genres Saccharomyces, Pichia, Hansenula ou Kluyveromyces, de preference Schwanniomyces. 

19. Procede selon I'une quelconque des revendications 17 et 18, caracterise en ce que ladite sequence d'ADN est 
choisie parmi les sequences se r6pliquant de facon autonome ARS, de preference une ARS obtenue a partir de 
I'organisme hdte respectif. 

40 

20. Procede selon la revendication 19, caracterise en ce que ladite sequence d'ADN est SwARSI ou SwARS2, 
capable de contrCler la replication autonome et le maintien stable du vecteur dans Schwanniomyces, dans laquelle 
SwARSI est constitute d'un fragment Cla\-BamH\ adjacent au fragment EcoRI presents a la figure 2a, et SwARS2 
est constitute d'un fragment EcoH\-Bgl\\ contenu dans le fragment EcoRI presente a la figure 2a. 

45 

21. Proc6d6 selon I'une quelconque des revendications 1 7 a 20, caracterise en ce que ladite sequence d'ADN a ete 
modifite en delttant, inserant ou substituant un ou plusieurs nucleotides, tout en retenant ou en am6liorant sa 
lonction biologique. 

so 22. Procede selon I'une quelconque des revendications 1 a 21 , caracterise en ce que ladite cassette d'expression 
est integree dans le genome, facultativement en plusieurs copies, de preference par recombinaison homologue. 

23. Proctde pour la preparation d'un polypeptide, dans lequel un organisme hote de levure est cuttiv6 dans des con- 
ditions appropri6es et le polypeptide est recupere d'une maniere connue d'elle-meme, caracterise en ce que ledit 

55 organisme hote de levure est une cellule de levure selon I'une quelconque des revendications 1 a 22. 

24. Procede selon la revendication 23, caracterise en ce que des cellules de levure de I'espece Schwanniomyces 
occidental sont utilisees en tant qu'organisme hote et sont de preference cultivees sur de I'amidon , de la dextrine, 
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du maltose et/ou de la biomasse vegetale. 

25 Precede selon rune quelconque des revendications 23 et 24, caracterise en ce que des cellules de tevure du 
genre Schwanniomyces sont utilisees pour la production d'une seule proteine de cellule. 

26. Procede selon I'une quelconque des revendications 23 a 25, caracterise en ce que ledit polypeptide est ('amylase 
ou la glucoamylase. 

27. Utilisation d'une cellule de levure selon I'une quelconque des revendications 1 a 22 pour la transformation et 
10 ('expression de genes etrangers. 
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